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DETAILED ACTION 

1. The Amendment filed May 8, 2008 in response to the Office Action of January 10, 2008 
is acknowledged and has been entered. Previously pending claim 1 has been cancelled; claims 2, 
21, 22, 27, and 28 have been amended and new claim 29 have been added. 

2. Claims 2, 21, 22, 27, 28 and 29 are currently being examined. 

3. Applicant's request for reconsideration of the finality of the rejection of the last Office 

action is persuasive and, therefore, the finality of that action is withdrawn as set forth in the 

summary of the interview of 05 May 2008. 

New Grounds of Rejection 
Priority 

4. Applicant's claim for the benefit of a prior-filed application under 35 U.S.C. 1 19(e) or 
under 35 U.S.C. 120, 121, or 365(c) is acknowledged. Applicant has not complied with one or 
more conditions for receiving the benefit of an earlier filing date under 35 U.S.C. 1 19(e) as 
follows: 

The later-filed application must be an application for a patent for an invention which is 
also disclosed in the prior apphcation (the parent or original nonprovisional application or 
provisional application). The disclosure of the invention in the parent application and in the later- 
filed application must be sufficient to comply with the requirements of the first paragraph of 35 
U.S.C. 112. See Transco Products, Inc. v. Performance Contracting, Inc., 38 F.3d 551, 32 
USPQ2d 1077 (Fed. Cir. 1994). 

The disclosure of the prior-filed applications fails to provide adequate support or 
enablement in the manner provided by the first paragraph of 35 U.S.C. 1 12 for claims 21, 22, 27, 
and 28 of this application. Examiner has established a priority date of 1 1/17/2003 for claims 21, 
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22, 27, and 28 because the claims as currently constituted recite isolated nucleic molecules that 
are adapted to express or encode specific residues of SEQ ID NO: 5 and a review of the parent 
Applications does not reveal the claimed limitations, see the written description rejection in 
section 6 below. Additionally, the priority date of claims 2 and 29 is November 15, 2002, based 
on application 60/427,045 being the prior filed application in which a nucleic acid comprising 
SEQ ID NO: 4 is present, i.e. SEQ ID NO: 314 of application 60/427,045. Applicant is invited 
to submit evidence pointing to the serial number, page and line where support can be found 
establishing an earlier priority date. 

Claim Rejections - 35 USC § 112 
The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims pai-ticularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

5. Claims 2, 21, 22, 27, 28 and 29 are rejected under 35 U.S.C. 1 12, second 
paragraph, as being indefinite for failing to particularly point out and distinctly claim the subject 
matter which applicant regards as the invention. 

Claim 27, and its dependent claim 2, are indefinite in that claim 27 is drawn to an isolated 
nucleic acid molecule encoding residues 10,432 to 22,152 of SEQ ID N0:5 or a fragment of 
residues 10,432 to 22,152 of SEQ ID N0:5; wherein the isolated nucleic acid molecule is an 
expression vector and is adapted to express in a cell residues 10,432 to 22,152 of SEQ ID N0:5 
or a fragment of residues 10,432 to 22,152 of SEQ ID NO:5;wherein the fragment of residues 
10,432 to 22,152 of SEQ ID NO:5 is an antigenic fragment that can be used to make monoclonal 
antibodies that specifically recognize CA125 and cannot be determined if the claim is limited to 
isolated nucleotides only encoding and expressing the fragment of residues 10,432 to 22,152 of 
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SEQ ID N0:5 or a fragment thereof, or if the claims encompass nucleic acids that encode and 
express fragments larger than residues 10,432 to 22,152 of SEQ ID N0:5. 

It will be assumed for examination purposes that the claim is drawn to an isolated nucleic 
acid molecule encoding a polypeptide comprising residues 10,432 to 22,152 of SEQ ID N0:5 or 
a polypeptide comprising a fragment of residues 10,432 to 22,152 of SEQ ID N0:5; wherein the 
isolated nucleic acid molecule is an expression vector and is adapted to express in a cell residues 
a polypeptide comprising 10,432 to 22,152 of SEQ ID N0:5 or a polypeptide comprising a 
fragment of residues 10,432 to 22,152 of SEQ ID N0:5;wherein the fragment of residues 10,432 
to 22,152 of SEQ ID N0:5 is an antigenic fragment that can be used to make monoclonal 
antibodies that specifically recognize CA125. 

Claim 28, and its dependent claims, claims 21, 22, and 29, are indefinite in that claim 28 
is drawn to an isolated nucleic acid molecule encoding CA125 (SEQ ID N0:5) or a fragment 
thereof wherein the isolated nucleic acid molecule is an expression vector and is adapted to 
express in a cell CA125 (SEQ ID N0:5) or a fragment thereof . . . wherein the isolated nucleic 
acid molecule encodes residues 1 to 10,431 of SEQ ID N0:5 or a fragment of residues 1 to 
10,431 of SEQ ID N0:5, and it can not be determined if the isolated nucleic molecule is also 
adapted to express residues 1 to 10,431 of SEQ ID NO: 5 or a fragment of residues 1 to 10,431 
of SEQ ID N0:5 or if it is only required to encode them and not necessarily express them. In 
other words, it is unclear if the claim is limited to a polynucleotide adapted to express residues 1 
to 10,431 of SEQ ID NO: 5 or a fragment of residues 1 to 10,431 of SEQ ID N0:5 or if 
polynucleotides expressing fragments larger than residues 1 to 10,431 of SEQ ID N0:5 are 
encompassed by the claim. 
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It will be assumed for examination purposes that the isolated nucleic acid molecules are 
in an expression vector and are adapted to express in a cell SEQ ID NO: 5 or a fragment thereof 
that comprise a nucleic acid molecule that encodes residues 1 to 10,431 of SEQ ID N0:5 or a 
fragment of residues 1 to 10,431 of SEQ ID N0:5, wherein the fragment thereof is an antigenic 
fragment that can be used to make monoclonal antibodies that specifically recognize CA125 
(SEQ ID NO: 5). Thus, the claims are interpreted as being drawn to polynucleotides encoding 
and expressing a protein that comprises at least a fragment of residues 1 to 10,431 of SEQ ID 
N0:5. 

The following is a quotation of the first paragraph of 35 U.S.C. 1 12: 

The specification shall contain a written description of the invention, and of the manner and process of making 
and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the same and shall set forth the best mode 
contemplated by the inventor of carrying out his invention. 

6. Claims 21, 22, 27, 28 and 29 are rejected under 35 U.S.C. 1 12, first paragraph, as 

failing to comply with the written description requirement. The c]aim(s) contains subject matter 
which was not described in the specification in such a way as to reasonably convey to one skilled 
in the relevant art that the inventor(s), at the time the application was filed, had possession of the 
claimed invention. 

In regard to claim 27, the limitation of an isolated nucleic acid molecule encoding 
residues 10,432 to 22,152 of SEQ ID NO:5 or a fragment of residues 10,432 to 22,152 of SEQ 
ID N0:5; wherein the isolated nucleic acid molecule is an expression vector and is adapted to 
express in a cell residues 10,432 to 22,152 of SEQ ID N0:5 or a fragment of residues 10,432 to 
22,152 of SEQ ID N0:5;wherein the fragment of residues 10,432 to 22,152 of SEQ ID N0:5 is 
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an antigenic fragment that can be used to make monoclonal antibodies that specifically recognize 
CA125 has no clear support in the specification and the claims as originally filed. 

Apphcants argue that claim 27 is supported, e.g., by SEQ ID NO: 162 of parent 
provisional application serial no. 60/427,045, which is identical to residues 10,432-22,152 of 
SEQ ID NO: 5. 

The suggested support is not found persuasive because, although SEQ ID NO: 162 may 
be identical to residues 10,432-22,152 of SEQ ID N0:5, this polypeptide does not provide 
support for the genus of claimed nucleic acids encoding or adapted to express the specifically 
claimed residues of SEQ ID NO: 5 or fragments thereof. 

Applicants argue that claim 27 is also supported, e.g., by SEQ ID NOS:34, 36, and 38 in 
Tables 14, 16, and 18 of parent provisional patent application serial no. 60/299,380, which are 
disclosed to be the amino terminal domain, repeat domain, and carboxy terminal domain of 
CA125 and together make residues 10,432-22,152 of SEQ ID N0:5. 

The suggested support is not found persuasive because, although SEQ ID NOS:34, 36, 
and 38 may make residues 10,432-22,152 of SEQ ID N0:5, these individual polypeptides do not 
provide support for the genus of claimed nucleic acids encoding or adapted to express the 
specifically claimed residues of SEQ ID NO: 5 or fragments thereof. 

Applicants argue that claim 27 is also supported, e.g., by originally filed claims 1, 4, 14, 
and 15. Originally filed claims 14 and 15 disclose fragments of SEQ ID NO:5 and antibodies that 
bind to SEQ ID N0:5 and fragments thereof. 

The suggested support is not found persuasive. Originally filed claims 1, 4, 14, and 15 
are drawn to: 1. An isolated nucleic acid molecule encoding CA125. 4. The isolated nucleic acid 
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molecule of claim 2 wherein said molecule is a fragment thereof. 14. A polypeptide with the 
amino acid sequence selected from the group consisting of: (a) the amino acid sequence set forth 
in SEQ ID NO: 5; (b) an amino acid sequence having at least 50% sequence identity to said 
sequence; (c) a conservative variant of an one of (a) to (b); and (d) a fragment of any one of (a) 
to (c). 15. A purified antibody that selectively binds to an amino acid sequence of the CA125 
protein: (a) wherein the amino acid sequence of the CA125 protein comprises the amino acid 
sequence set forth in SEQ ID NO: 5; (b) an amino acid sequence having at least 50% sequence 
identity to said sequence; (c) a conservative variant of any one of (a) to (b); and (d) a fragment of 
any one of (a) to (c). Originally filed claims 1, 4, 14, and 15 do provide support for or suggest the 
genus of claimed nucleic acids encoding or adapted to express the specifically claimed residues 
of SEQ ID NO: 5 or fragments thereof. 

Applicants argue that claim 27 is also supported by provisional patent application serial 
no. 60/299,380 as follows. Pages 19-20 of provisional patent application serial no. 60/299,380 
discloses recombinant domains and epitopes of CA125 and antibodies against recombinant 
domains. Pages 6-7 of provisional patent application serial no. 60/299,380 disclose isolated 
nucleic acids and fragments of the nucleic acids isolated, and expressing isolated nucleic acids 
from vectors. Page 2, first line of the Summary of provisional patent application serial no. 
60/299,380 discloses isolating portions of the CA 125 gene. Page 3, lines 5-10 and page 4, line 3 
of provisional patent application serial no. 60/299,380 disclose use of recombinant domains, 
such as individual repeat units, of CA 125. Page 3, lines 15-18 of provisional patent application 
serial no. 60/299,380 disclose recombinant domains of CA125 encompassing epitope binding 
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sites for murine antibodies. There is thus abundant support for isolated nucleic acids used to 
express fragments of CA125 that can be used to generate antibodies that recognize CA125. 

The suggested support is not found persuasive because the cited passages of application 
serial no. 60/299,380 do not provide support for or suggest the claimed nucleic acid encoding or 
adapted to express the specifically claimed residues of SEQ ID NO: 5 or fragments thereof. 
Given the above the subject matter claimed in claim 27 broadens the scope of the invention as 
originally disclosed in the specification. 

In regard to claim 28 and its dependent claims 21, and 22, the limitation of the isolated 
nucleic acid molecule is an expression vector and is adapted to express in a cell CA125 (SEQ ID 
N0:5) or a fragment thereof . . . wherein the isolated nucleic acid molecule encodes residues 1 to 
10,431 of SEQ ID N0:5 or a fragment of residues 1 to 10,431 of SEQ ID N0:5 has no clear 
support in the specification and the claims as originally fUed. 

Applicants argue that claim 28 is supported, e.g., by SEQ ID NO: 310 of parent 
provisional patent application serial no. 60/427,045, which is identical to residues 1-10,431 of 
SEQ Id N0:5. 

The suggested support is not found persuasive because, although SEQ ID NO: 310 may 
be identical to residues 1-10,431 of SEQ ID N0:5, this polypeptide does not provide support for 
the genus of claimed nucleic acids encoding or adapted to express the specifically claimed 
residues of SEQ ID NO: 5 or fragments thereof. 

Applicants argue that claim 28 is also supported, e.g., by originally filed claims 1, 4, 14, 
and 15, by SEQ ID N0:5, and by paragraphs [0009], [0011], and [0041] of the specification, and 
by SEQ ID NOS: 1 and 4. Paragraph [0009] discloses that the extracellular amino terminal 
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domain is encoded by exons 1-9, as set out in SEQ ID NO: 1. It discloses that exon 4 is 
nucleotides 34575 to 38024 of SEQ ID NO:I. Paragraphs [0011] and [0041] disclose that the 
amino terminal extension comprises (is encoded by) four genomic exons [exons 1-4 described in 
paragraph 0009]. A comparison of the sequence of exon 4 (nucleotides 34575-38024 ofSEQ ID 
N0:1) and the cDNA of SEQ ID NO:4 reveals that exon 4 ends at nucleotide 31,485 of SEQ ID 
N0:4. A comparison of the sequences of exons 1-4 of SEQ ID NO: 1, the cDNA sequence of 
SEQ ID NO: 4, and the protein sequence of SEQ ID N0:5 reveals that exons 1-4 encode residues 
1-10,427 of SEQ ID N0:5. Applicants argue that Claim 21 is supported, e.g., by SEQ ID NO: 1 
and paragraph [0009]. Applicants argue that the element of fragments of SEQ ID N0:5 
recognized by an antibody that selectively binds to SEQ ID N0:5 is supported, e.g., by originally 
filed claim 15, part (d), and claim 14. 

The suggested support is not found persuasive. Originally filed claims 1, 4, 14, and 15 are 
drawn to: 1. An isolated nucleic acid molecule encoding CA125. 4. The isolated nucleic acid 
molecule of claim 2 wherein said molecule is a fragment thereof. 14. A polypeptide with the 
amino acid sequence selected from the group consisting of: (a) the amino acid sequence set forth 
in SEQ ID NO: 5; (b) an amino acid sequence having at least 50% sequence identity to said 
sequence; (c) a conservative variant of an one of (a) to (b); and (d) a fragment of any one of (a) 
to (c). 15. A purified antibody that selectively binds to an amino acid sequence of the CA125 
protein: (a) wherein the amino acid sequence of the CA125 protein comprises the amino acid 
sequence set forth in SEQ ID NO: 5; (b) an amino acid sequence having at least 50% sequence 
identity to said sequence; (c) a conservative variant of any one of (a) to (b); and (d) a fragment of 
any one of (a) to (c). Originally filed claims 1, 4, 14, and 15 do provide support for or suggest the 
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claimed nucleic acid encoding or adapted to express the specifically claimed residues of SEQ ID 
NO: 5 or fragments thereof. Additionally the polypeptide of SEQ ID NO: 5 does not support 
or suggest the genus of claimed nucleic acids encoding or adapted to express the specifically 
claimed residues of SEQ ID NO: 5 or fragments thereof. Additionally, the disclosure of the 
individual exons of CA125 and SEQ ID NO: 4, although exons 1-4 may encode residues 1- 
10,427 of SEQ ID N0:5, does not support or suggest the genus of claimed nucleic acids 
encoding or adapted to express the specifically claimed residues of SEQ ID NO: 5 or fragments 
thereof 

Additionally, although an isolated nucleic acid molecule comprising the sequence of SEQ 
ID NO: 1 in an expression vector would inherently express at least a fragment of residues 1- 
10,431 of SEQ ID N0:5, there is not inherent support for fragments of SEQ ID NO: 1 encoding 
and expressing residues 1-10,431 of SEQ ID N0:5 or fragments thereof. 

Given the above the subject matter claimed in claims 21, 22 and 28 broadens the scope of 
the invention as originally disclosed in the specification. 

Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed piiblicalioii in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 

7. Claim 27 is rejected under 35 U.S.C. 102(b) as being anticipated by Yin and 
Lloyd (J. Biol. Chem. July 20, 2001. 276: 27371-27375, previously cited) and evidenced by 
appendix 1. 
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Yin and Lloyd teach cloning a C-terminal fragment of CA125 by screening a XZAF 
cDNA expression library of cDNA from OVCAR3 cells with an antibody to CA125, see 
Abstract, Materials and Methods, p. 27,372, and the Figures. Given that the XZAP cDNA vectors 
express fragments of cDNA that are detected by a CA125, Lin and Lloyd teach an isolated 
expression vector with a fragment of SEQ ID NO: 4 encoding a fragment of CA125 that is 
adapted to express in a cell a fragment of SEQ ID NO: 5 that is a fragment of residues 10,432 to 
22,152 of SEQ ID NO: 5, see appendix 1. 

Given that the polynucleotide of the prior art reference encodes a polypeptide that is 
recognized by an antibody to CA125, it would be expected that the encoded fragment could be 
used to make monoclonal antibodies that specifically recognize CA125. Although the reference 
does not specifically state that the isolated CA125 nucleotide fragment can be used to make 
monoclonal antibodies that specifically recognize CA125, the claimed product appears to be the 
same as the prior art product, absent a showing of unobvious differences. The office does not 
have the facilities and resources to provide the factual evidence needed in order to establish that 
the product of the prior art does not possess the same material, structural and functional 
characteristics of the claimed product. In the absence of evidence to the contrary, the burden is 
on the applicant to prove that the claimed product is different from those taught by the prior art 
and to establish patentable differences. See In re Best 562F.2d 1252, 195 USPQ 430 (CCPA). 

Applicants argue that claim 27 has a priority date of at least June 19, 2001. Claim 27 is 
fully supported by provisional patent application 60/299,380, which was filed June 19, 2001. 
SEQ ID NOS:34, 36, and 38 in Tables 14, 16, and 18 of parent provisional patent application 
serial no. 60/299,380 are disclosed to be the amino terminal domain, repeat domain, and carboxy 
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terminal domain of CA125. Together these make up residues 10,432-22,152 of SEQ ID NO: 5 as 
is recited in claim 27. Pages 6-7 of provisional patent application serial no. 60/299,380 disclose 
isolated nucleic acids and fragments of the nucleic acids isolated, and expressing isolated nucleic 
acids from vectors. Page 2, first line of the Summary of provisional patent application serial no. 
60/299,380 discloses isolating portions of the CA125 gene. Page 3, lines 5-10 and page 4, line 3 
of provisional patent application serial no. 60/299,380 discloses use of recombinant domains, 
such as individual repeat units, of CA125. Page 3, Unes 1 1-18 of provisional patent application 
serial no. 60/299,380 discloses recombinant domains of CA 125 encompassing epitope binding 
sites for murine antibodies, and use of the recombinant molecules as vaccines or to stimulate 
patients' immune systems. There is thus abundant support for expressing fragments of CA 125 
that can be used to generate antibodies that recognize CA125, as is recited in claim 27. The 
priority date of claim 27 is thus before Yin and Lloyd, and Yin and Lloyd is not prior art to claim 
27. 

Applicants arguments have been considered, but have not been found persuasive because, 
although SEQ ID NOS:34, 36, and 38 may make residues 10,432-22,152 of SEQ ID N0:5, these 
individual polypeptides do not provide support for the genus of claimed nucleic acids encoding 
or adapted to express the specifically claimed residues of SEQ ID NO: 5 or fragments thereof. 
Additionally, the cited passages of application serial no. 60/299,380 do not provide support for 
or suggest the claimed nucleic acid encoding or adapted to express the specifically claimed 
residues of SEQ ID NO: 5 or fragments thereof. Thus the priority date of claim 27 is 
1 1/17/2003 as set forth above. 



Application/Control Number: 10/715,066 Page 13 

Art Unit: 1643 

Applicants argue that Yin and Lloyd (3. Biol. Chem., July 20, 2001, 276:27371-27375) 
states that the authors isolated a 5797 -base pair sequence containing a stop codon but no clear 5' 
initiation sequence (abstract). And it is dated July 20, 2001. The alignment the Examiner shows, 
however, is with Genbank locus AF361486, which is 21,1 12 bp (not 5797 bp) and states that it 
was updated on Sept. 8, 2003. 

Applicant submits with this response in an Information Disclosure Statement the revision 
history of AF361486 and AF361486 version 1 GI: 1497 1109. The revision history shows that 
version GI: 14971 109 is the earliest version of AF361486 and was submitted on July 20, 2001. 
In the version submitted on July 20, 2001, AF361486 only had 5797 nucleotides, the same as 
Yin and Lloyd. The next revision of AF361486 was on Aug. 26, 2003. Version GI: 1497 1 109 
encodes an 1890-amino-acid protein that is homologous to the carboxy terminal 1890 amino acid 
residues of the present SEQ ID N0:5 and appears to be the protein sequence disclosed in Yin 
and Lloyd. The 21,1 12 bp sequence of the present AF361486 was only submitted on September 
8, 2003. 

Applicants arguments have been considered, but have not been found persuasive because 
the date of pubhcation of the Yin and Lloyed article and AF361486 is July 20, 2001, the priority 
date of claim 27 is 1 1/17/2003 as set forth above, and the sequence encodes and is adapted to 
express a fragment in a cell a fragment of SEQ ID NO: 5 that is a fragment of residues 10,432 to 
22,152 of SEQ ID NO: 5. 

8. Claim 27 is rejected under 35 U.S.C. 102(b) as being anticipated by O'Brien et al. 
(Tumor Biology 2001 Nov-Dec; 22(6):348-366, IDS item) as evidenced by O'Brien et al. 
(Tumor Biology 2002 May-Jun; 23(3): 154- 169, IDS item). 
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O'Brien et al. (2001) teach cloning a CA125 repeat domain into an expressing vector and 
expressing it in cells, see para, bridging page 349-350, and Fig. 4 and 5. O'Brien et al. (2002) 
teach that the repeat domains are with residues 10,432-22,152 of CA125, see Fig. 7. 

Although the reference does not specifically state that the isolated CA125 nucleotide 
fragment can be used to make monoclonal antibodies that specifically recognize CA125, the 
claimed product appears to be the same as the prior art product, absent a showing of unobvious 
differences. The office does not have the facilities and resources to provide the factual evidence 
needed in order to establish that the product of the prior art does not possess the same material, 
structural and functional characteristics of the claimed product. In the absence of evidence to the 
contrary, the burden is on the applicant to prove that the claimed product is different from those 
taught by the prior art and to establish patentable differences. See In re Best 562F.2d 1252, 195 
USPQ 430 (CCPA). 

9. Claims 21, 22, 27, and 28 are rejected under 35 U.S.C. 102(b) as being anticipated 
by WO 2002/68579 (Venter et al. 6 September 2002) as evidenced by Appendix 2. 

Venter et al. teach isolated nucleic acids encoding fragments of residues 1-10,427 and 10- 
432 to 22, 152, see Appendix 2. 

Venter et al. teach placing the isolated nucleic acids of the invention into vectors, see p. 
18, lines 4-15. Venter et al. teach vectors with promoters that modulate the expression of an 
operably linked sequence, see page 32, lines 1-26. Venter et al. teach producing proteins with the 
isolated nucleic acids of the invention, see p. 10 Unes 18-30 and page 34, lines 1-8. One of skill 
in the art would immediately recognize vectors with promoters that modulate the expression of 
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an operably linked sequence as an expression vector to be used for the expression and production 
of proteins from the isolated sequences. 

Although the reference does not specifically state that the isolated CA125 nucleotide 
fragment can be used to make monoclonal antibodies that specifically recognize CA125/SEQ ID 
NO: 5, the claimed product appears to be the same as the prior art product, absent a showing of 
unobvious differences. The office does not have the facilities and resources to provide the 
factual evidence needed in order to establish that the product of the prior art does not possess the 
same material, structural and functional characteristics of the claimed product. In the absence of 
evidence to the contrary, the burden is on the applicant to prove that the claimed product is 
different from those taught by the prior art and to establish patentable differences. See In re Best 
562F.2d 1252, 195 USPQ 430 (CCPA). 

10. Claim 27 is rejected under 35 U.S.C. 102(b) as being anticipated by WO 
2001/51513 (Algate et al. 19 July 2001) as evidenced by Appendix 3. 

Algate et al. teach isolated nucleic acids encoding fragments of residues 10,432 to 22, 
152, see Appendix 3. Algate et al. teach putting the nucleic acids of the invention into expression 
vectors encoding the polypeptides in host cells, see page 2, lines 11-15. 

Although the reference does not specifically state that the isolated CA125 nucleotide 
fragment can be used to make monoclonal antibodies that specifically recognize CA125, the 
claimed product appears to be the same as the prior art product, absent a showing of unobvious 
differences. The office does not have the facilities and resources to provide the factual evidence 
needed in order to establish that the product of the prior art does not possess the same material, 
structural and functional characteristics of the claimed product. In the absence of evidence to the 
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contrary, the burden is on the applicant to prove that the claimed product is different from those 
taught by the prior art and to establish patentable differences. See In re Best 562F.2d 1252, 195 
USPQ430 (CCPA). 

Double Patenting 

The nonstatutory double patenting rejection is based on a judicially created doctrine 
grounded in public policy (a policy reflected in the statute) so as to prevent the unjustified or 
improper timewise extension of the "right to exclude" granted by a patent and to prevent possible 
harassment by multiple assignees. A nonstatutory obviousness-type double patenting rejection 
is appropriate where the conflicting claims are not identical, but at least one examined 
application claim is not patentably distinct from the reference claim(s) because the examined 
application claim is either anticipated by, or would have been obvious over, the reference 
claim(s). See, e.g.. In re Berg, 140 F.3d 1428, 46 USPQ2d 1226 (Fed. Cir. 1998); In re 
Goodman, 11 F.3d 1046, 29 USPQ2d 2010 (Fed. Cir. 1993); In re Longi, 759 F.2d 887, 225 
USPQ 645 (Fed. Cir. 1985); In re Van Ornum, 686 F.2d 937, 214 USPQ 761 (CCPA 1982); In re 
Vogel, All F.2d 438, 164 USPQ 619 (CCPA 1970); and In re Thorington, 418 F.2d 528, 163 
USPQ 644 (CCPA 1969). 

A timely filed terminal disclaimer in compliance with 37 CFR 1.321(c) or 1.321(d) may 
be used to overcome an actual or provisional rejection based on a nonstatutory double patenting 
ground provided the conflicting application or patent either is shown to be commonly owned 
with this application, or claims an invention made as a result of activities undertaken within the 
scope of a joint research agreement. 

Effective January 1, 1994, a registered attorney or agent of record may sign a terminal 
disclaimer. A terminal disclaimer signed by the assignee must fully comply with 37 CFR 
3.73(b). 



11. Claims 2, 21, 22, 27, 28, and 29 are provisionally rejected on the ground of nonstatutory 
obviousness-type double patenting as being unpatentable over claim 48-50 and 52-55 of 
copending Application No. 1 1/975,668 in view of in view of US Patent No. 4,889,806 and 
Sambrook et al (Molecular Cloning, A Laboratory Manual, Cold Spring Harbor, 1989, p. 16.3- 
36). 

Claims 48-50 and 52-55 of Application No. 1 1/975, 668 are drawn to are drawn to a an 
isolated nucleic acid encoding a polypeptide comprising a fragment of CA125 (SEQ ID NO:315) 
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selected from the group consisting of: (i) residues 1-1637 of SEQ ID NO:299 and an antigenic 
fragment of residues 1-1637 of SEQ ID NO:299; (ii) a repeat unit selected from repeat units 1-63 
of Table 16; (iii) SEQ ID NOS: 164-191,195-208, 210-220, 222-225,227-247, 250-254, 256-276, 
and 278-293, ; and (iv) SEQ ID NO:300. The isolated nucleic acid of claim 50 wherein the 
nucleic acid encodes a polypeptide comprising SEQ ID NO:315. An isolated nucleic acid 
encoding a polypeptide comprising CA 125 (SEQ ID NO:315) or a fragment thereof; wherein the 
polypeptide comprises residues 1-10,427 of SEQ ID NO:310 or an antigenic fragment of 
residues 1-10,427 of SEQ ID NO:310. 

It is noted that SEQ ID NO: 315 of 1 1/975,668 is the full length CA125 protein which is 
identical to the full length CA125/ SEQ ID NO: 5 of the instant application and SEQ ID NO: 
314 1 1/975,668 is identical to SEQ ID NO: 4 of the instant application. Thus, SEQ ID NO: 4 is 
clearly contemplated as a polynucleotide encoding the CA125 protein. 

US Patent No. 4,889,806 teach that with the advent of recombinant DNA and molecular 
cloning technology it is now conventional to transfer genetic information into plasmids or 
vectors constructed in vitro and then transferred into host cells to be clonally propagated (col 1, 
lines 18-24). 

Sambrook et al teach that cloned genes are conventionally expressed using expression 
vectors and that expression of cloned proteins have been used to: (1) confirm the identity of a 
cloned gene by using immunological or functional assays to detect the encoded protein; (2) 
produce large amounts of proteins of biological interest that are normally available in only 
limited quantities from natural sources; (3) to study the biosynthesis and intracellular transport of 
proteins following their expression in various cell types; and (4) to elucidate structure-function 
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relationships by analyzing the properties of normal and mutant proteins (para bridging pages 
16.3 and 16.4). 

It would have been prima facie obvious to one of ordinary skill in the art at the time the 
invention was made to put the CA125 nucleic acids of copending Application No. 11/975, 668 
in expression vectors as taught by Sambrook et al and US Patent No. 4,889,806 because US 
Patent No. 4,889,806 specifically teaches that it is conventional to transfer genetic materials into 
plasmids or vectors and then transfer the plasmids or vectors into host cells and clonally 
propagate the genetic material and because Sambrook et al teach that cloned genes are 
conventionally expressed using expression vectors. One of ordinary skill in the art at the time 
the invention was made would have been motivated to put the sequences of U.S. Patent No. 6, 
261,836 in plasmid vectors operably linked to promoters as Sambrook et al and US Patent No. 
4,889,806 because Sambrook et al specifically teach that expressed cloned proteins are used to: 
(1) confirm the identity of a cloned gene by using immunological or functional assays to detect 
the encoded protein; (2) produce large amounts of proteins of biological interest that are 
normally available in only limited quantities from natural sources; (3) to study the biosynthesis 
and intracellular transport of proteins following their expression in various cell types; and (4) to 
elucidate structure-function relationships by analyzing the properties of normal and mutant 
proteins. 

This is a provisional obviousness-type double patenting rejection. 

12. All other objections and rejections recited in January 10, 2008 are withdrawn. 

13. No claims allowed. 
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14. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Peter J. Reddig whose telephone number is (571) 272-9031. The 
examiner can normally be reached on M-F 8:30 a.m.-5:00 p.m. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Larry Helms can be reached on (571) 272-0832. The fax phone number for the 
organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an apphcation may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (BBC) at 866-217-9197 (toll-free). If you would 
like assistance from a USPTO Customer Service Representative or access to the automated 
information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/Peter J Reddig/ 

Examiner, Art Unit 1642 

/P. J. R./ 



/Karen A Canella/ 

Primary Examiner, Art Unit 1643 
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Appendix 1. 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



5 79 7 bp mRNA linear 
(MUC16) mRNA, partial cds . 



PRI 20-JUL-2001 



Chordata 



Craniata; Vertebrata; Euteleostomi; 
Catarrhini; Hominidae; Homo. 



AF361486 
Homo sapiens 
AF361486 

AF361486.1 GI:149711 

Homo sapiens (human) 
Ho"o gapien^ 
Eukaryota; Met; 
Mammalia; Eutheria; 

1 (bases 1 to 5797 
Yin,B.W. and Lloyd, K.O. 
Molecular cloning of the cal25 ovarii 
identification as a new mucin, mucl6 

J. Biol. Chem. 276 (29), 27371-27375 (2001) 

2 (bases 1 to 5797) 
Lloyd, K.O. and Yin, B.W.T. 
Direct Submission 

Submitted ( 15-MAR-2001 ) Sloan-Kettering Institute for Cancer 
Research, 1275 York Ave., New York, NY 10021, USA 
[WARNING] On Aug 26, 2003 this sequence was replaced by 
gi: 3 122 3 MO. 

Location /Qualifiers 
i 1..5797 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

<1..5797 

/gene="MUC16" 

<1 . . 5673 

/gene="MUC16" 

/note= "ovarian cancer antigen CA125" 
/codon_start=l 
/product="mucin 16" 
/protein_id="AAK74120 .1" 
/db_xref ="GI : 149 71110"""" 

/trans lation="RVDPIGPGLDRERLYWELSQLTNSITELGPYTLDRDSLYVHGFN 
PWSSVPTTSTPGTSTVHLATSGTPSSLPGHTAPVPLLIPFTLnFTITNLHYEEHMQHP 
GSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKHGAATGVDAICTLRLDP 
TGPGLDRERLYWELSQLTNSVTELGPYTLDRDSLYVNGFTHRSSVPTTSIPGTSAVHL 
ETSGTPASLPGHTAPGPLLVPFTLNFTITNLQYEEDMRHPGSRKFNTTERVLQGLLKP 
LFKSTSVGPLYSGCRLTLLRPEKRGAATGVDTICTHRLDPLNPGLDREQLYWELSKLT 
RGIIELGPYLLDRGSLYVNGFTHRNFVPITSTPGTSTVHLGTSETPSSLPRPIVPGPL 
LVPFTLNFTITNLQYEEAMRHPGSRKFNTTERVLQGLLRPLFKNTSIGPLYSSCRLTL 
LRPEKDKAATRVDAICTHHPDPQSPGLNREQLYWELSQLTHGITELGPYTLDRDSLYV 
DGFTHWSPIPTTSTPGTSIVNLGTSGIPPSLPETTATGPLLVPFTLNFTITNLQYEEN 
MGHPGSRKFNITESVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKDGVATRVDAICTH 
RPDPKIPGLDRQQLYWELSQLTHSITELGPYTLDRDSLYVNGFTQRSSVPTTSTPGTF 
TVQPETSETPSSLPGPTATGPVLLPFTLNFTIINLQYEEDMHRPGSRKFNTTERVLQG 
LLMPLFKNTSVSSLYSGCRLTLLRPEKDGAATRVDAVCTHRPDPKSPGLDRERLYWKL 
SQLTHGITELGPYTLDRHSLYVNGFTHQSSMTTTRTPDTSTMHLATSRTPASLSGPTT 
ASPLLVLFTINFTITNLRYEENMHHPGSRKFNTTERVLQGLLRPVFKNTSVGPLYSGC 
RLTLLRPKKDGAATKVDAICTYRPDPKSPGLDREQLYWELSQLTHSITELGPYTLDRD 
SLYVNGFTQRSSVPTTSIPGTPTVDLGTSGTPVSKPGPSAASPLLVLFTLNFTITNLR 
YEENMQHPGSRKFNTTERVLQGLLRSLFKSTSVGPLYSGCRLTLLRPEKDGTATGVDA 
ICTHHPDPKSPRLDREQLYWELSQLTHNITELGPYALDNDSLFVNGFTHRSSVSTTST 
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PGTPTVYLGASKTPASIFGPSAASHLLILFTLNFTITNLRYEENMWPGSRKFNTTERV 
LQGLLRPLFKNTSVGPLYSGCRLTLLRPEKDGEATGVDAICTHRPDPTGPGLDREQLY 
LELSQLTHSITELGPYTLDRDSLYVNGFTHRSSVPTTSTGWSEEPFTLNFTINNLRY 
MADMGQPGSLKFNITDNVMQHLLSPLFQRSSLGARYTGCRVIALRSVKNGAETRVDLL 
CTYLQPLSGPGLPIKQVFHELSQQTHGITRLGPYSLDKDSLYLNGYNEPGPDEPPTTP 
KPATTFLPPLSEATTAMGYHLKTLTLNFTISIILQYSPDMGKGSATFNSTEGVLQHLLR 
PLFQKSSMGPFYLGCQLISLRPEKDGAATGVDTTCTYHPDPVGPGLDIQQLYWELSQL 
THGVTQLGFYVLDRDSLFINGYAPQNLSIRGEYQINFHIVNWNLSNPDPTSSEYITLL 
RDIQDKVTTLYKGSQLHDTFRFCLVTNLTMDSVLVTVKALFSSNLDPSLVEQVFLDKT 
LNASFHWLGSTYQLVDIHVTEMESSVYQPTSSSSTQHFYPNFTITNLPYSQDBCAQPGT 
TNYQRNKRNIEDALNQLFRNSSIKSYFSDCQVSTFRSVPNRHHTGVDSLCNFSPLARR 
VDRVAIYEEFLRMTRNGTQLQNFTLDRSSVLVDGYSPNRNEPLTGNSDLPFWAVILIG 
LAGLLGLITCLICGVLVTTRRRKKEGEYNVQQQCPGYYQSHLDLEDLQ" 

ORIGIN 

1 cgcgttgatc ccatcggacc tggactggac agagagcggc tatactggga gctgagccag 
61 ctgaccaaca gcatcacaga gctgggaccc tacaccctgg atagggacag tctctatgtc 
121 aatggcttca acccttggag ctctgtgcca accaccagca ctcctgggac ctccacagtg 
181 cacctggcaa cctctgggac tccatcctcc ctgcctggcc acacagcccc tgtccctctc 

2 41 ttgataccat tcaccctcaa ctttaccatc accaacctgc attatgaaga aaacatgcaa 

3 01 caccctggtt ccaggaagtt caacaccacg gagagggttc tgcagggtct gctcaagccc 
361 ttgttcaaga gcaccagcgt tggccctctg tactctggct gcagactgac cttgctcaga 
421 cctgagaaac atggggcagc cactggagtg gacgccatct gcaccctccg ccttgatccc 
481 actggtcctg gactggacag agagcggcta tactgggagc tgagccagct gaccaacagc 
541 gttacagagc tgggccccta caccctggac agggacagtc tctatgtcaa tggcttcacc 
601 catcggagct ctgtgccaac caccagtatt cctgggacct ctgcagtgca cctggaaacc 
661 tctgggactc cagcctccct ccctggccac acagcccctg gccctctcct ggtgccattc 
721 accctcaact tcactatcac caacctgcag tatgaggagg acatgcgtca ccctggttcc 
781 aggaagttca acaccacgga gagagtcctg cagggtctgc tcaagccctt gttcaagagc 
841 accagtgttg gccctctgta ctctggctgc agactgacct tgctcaggcc tgaaaaacgt 
901 ggggcagcca ccggcgtgga caccatctgc actcaccgcc ttgaccctct aaaccctgga 
961 ctggacagag agcagctata ctgggagctg agcaaactga cccgtggcat catcgagctg 

1021 ggcccctacc tcctggacag aggcagtctc tatgtcaatg gtttcaccca tcggaacttt 
1081 gtgcccatca ccagcactcc tgggacctcc acagtacacc taggaacctc tgaaactcca 
1141 tcctccctac ctagacccat agtgcctggc cctctcctgg tgccattcac cctcaacttc 

12 01 accatcacca acttgcagta tgaggaggcc atgcgacacc ctggctccag gaagttcaat 
1261 accacggaga gggtcctaca gggtctgctc aggcccttgt tcaagaatac cagtatcggc 
1321 cctctgtact ccagctgcag actgaccttg ctcaggccag agaaggacaa ggcagccacc 

13 81 agagtggatg ccatctgtac ccaccaccct gaccctcaaa gccctggact gaacagagag 
1441 cagctgtact gggagctgag ccagctgacc cacggcatca ctgagctggg cccctacacc 
1501 ctggacaggg acagtctcta tgtcgatggt ttcactcatt ggagccccat accaaccacc 
1561 agcactcctg ggacctccat agtgaacctg ggaacctctg ggatcccacc ttccctccct 
1621 gaaactacag ccaccggccc tctcctggtg ccattcacac tcaacttcac catcactaac 
16 81 ctacagtatg aggagaacat gggtcaccct ggctccagga agttcaacat cacggagagt 
1741 gttctgcagg gtctgctcaa gcccttgttc aagagcacca gtgttggccc tctgtattct 
1801 ggctgcagac tgaccttgct caggcctgag aaggacggag tagccaccag agtggacgcc 
1861 atctgcaccc accgccctga ccccaaaatc cctgggctag acagacagca gctatactgg 
1921 gagctgagcc agctgaccca cagcatcact gagctgggac cctacaccct ggatagggac 
1981 agtctctatg tcaatggttt cacccagcgg agctctgtgc ccaccaccag cactcctggg 
2041 actttcacag tacagccgga aacctctgag actccatcat ccctccctgg ccccacagcc 
2101 actggccctg tcctgctgcc attcaccctc aattttacca tcattaacct gcagtatgag 
2161 gaggacatgc atcgccctgg ctccaggaag ttcaacacca cggagagggt ccttcagggt 
2221 ctgcttatgc ccttgttcaa gaacaccagt gtcagctctc tgtactctgg ttgcagactg 

22 81 accttgctca ggcctgagaa ggatggggca gccaccagag tggatgctgt ctgcacccat 

23 41 cgtcctgacc ccaaaagccc tggactggac agagagcggc tgtactggaa gctgagccag 
2401 ctgacccacg gcatcactga gctgggcccc tacaccctgg acaggcacag tctctatgtc 
2461 aatggtttca cccatcagag ctctatgacg accaccagaa ctcctgatac ctccacaatg 
2521 cacctggcaa cctcgagaac tccagcctcc ctgtctggac ctacgaccgc cagccctctc 
2581 ctggtgctat tcacaattaa cttcaccatc actaacctgc ggtatgagga gaacatgcat 
2641 caccctggct ctagaaagtt taacaccacg gagagagtcc ttcagggtct gctcaggcct 
2701 gtgttcaaga acaccagtgt tggccctctg tactctggct gcagactgac cttgctcagg 
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2761 cccaagaagg atggggcagc caccaaagtg gatgccatct gcacctaccg ccctgatccc 

2821 aaaagccctg gactggacag agagcagcta tactgggagc tgagccagct aacccacagc 

2881 atcactgagc tgggccccta caccctggac agggacagtc tctatgtcaa tggtttcaca 

2941 cagcggagct ctgtgcccac cactagcatt cctgggaccc ccacagtgga cctgggaaca 

3001 tctgggactc cagtttctaa acctggtccc tcggctgcca gccctctcct ggtgctattc 

3061 actctcaact tcaccatcac caacctgcgg tatgaggaga acatgcagca ccctggctcc 

3121 aggaagttca acaccacgga gagggtcctt cagggcctgc tcaggtccct gttcaagagc 

3181 accagtgttg gccctctgta ctctggctgc agactgactt tgctcaggcc tgaaaaggat 

3241 gggacagcca ctggagtgga tgccatctgc acccaccacc ctgaccccaa aagccctagg 

3301 ctggacagag agcagctgta ttgggagctg agccagctga cccacaatat cactgagctg 

3361 ggcccctatg ccctggacaa cgacagcctc tttgtcaatg gtttcactca tcggagctct 

3421 gtgtccacca ccagcactcc tgggaccccc acagtgtatc tgggagcatc taagactcca 

3481 gcctcgatat ttggcccttc agctgccagc catctcctga tactattcac cctcaacttc 

3541 accatcacta acctgcggta tgaggagaac atgtggcctg gctccaggaa gttcaacact 

36 01 acagagaggg tccttcaggg cctgctaagg cccttgttca agaacaccag tgttggccct 

3661 ctgtactctg gctgcaggct gaccttgctc aggccagaga aagatgggga agccaccgga 

3721 gtggatgcca tctgcaccca ccgccctgac cccacaggcc ctgggctgga cagagagcag 

3781 ctgtatttgg agctgagcca gctgacccac agcatcactg agctgggccc ctacacactg 

3841 gacagggaca gtctctatgt caatggtttc acccatcgga gctctgtacc caccaccagc 

3901 accggggtgg tcagcgagga gccattcaca ctgaacttca ccatcaacaa cctgcgctac 

3961 atggcggaca tgggccaacc cggctccctc aagttcaaca tcacagacaa cgtcatgcag 

4021 cacctgctca gtcctttgtt ccagaggagc agcctgggtg cacggtacac aggctgcagg 

4081 gtcatcgcac taaggtctgt gaagaacggt gctgagacac gggtggacct cctctgcacc 

4141 tacctgcagc ccctcagcgg cccaggtctg cctatcaagc aggtgttcca tgagctgagc 

4201 cagcagaccc atggcatcac ccggctgggc ccctactctc tggacaaaga cagcctctac 

4261 cttaacggtt acaatgaacc tggtccagat gagcctccta caactcccaa gccagccacc 

4321 acattcctgc ctcctctgtc agaagccaca acagccatgg ggtaccacct gaagaccctc 

4381 acactcaact tcaccatctc caatctccag tattcaccag atatgggcaa gggctcagct 

4441 acattcaact ccaccgaggg ggtccttcag cacctgctca gacccttgtt ccagaagagc 

4501 agcatgggcc ccttctactt gggttgccaa ctgatctccc tcaggcctga gaaggatggg 

4561 gcagccactg gtgtggacac cacctgcacc taccaccctg accctgtggg ccccgggctg 

4621 gacatacagc agctttactg ggagctgagt cagctgaccc atggtgtcac ccaactgggc 

46 81 ttctatgtcc tggacaggga tagcctcttc atcaatggct atgcacccca gaatttatca 

4741 atccggggcg agtaccagat aaatttccac attgtcaact ggaacctcag taatccagac 

4801 cccacatcct cagagtacat caccctgctg agggacatcc aggacaaggt caccacactc 

4861 tacaaaggca gtcaactaca tgacacattc cgcttctgcc tggtcaccaa cttgacgatg 

4921 gactccgtgt tggtcactgt caaggcattg ttctcctcca atttggaccc cagcctggtg 

49 81 gagcaagtct ttctagataa gaccctgaat gcctcattcc attggctggg ctccacctac 

5041 cagttggtgg acatccatgt gacagaaatg gagtcatcag tttatcaacc aacaagcagc 

5101 tccagcaccc agcacttcta cccgaatttc accatcacca acctaccata ttcccaggac 

5161 aaagcccagc caggcaccac caattaccag aggaacaaaa ggaatattga ggatgcgctc 

5221 aaccaactct tccgaaacag cagcatcaag agttattttt ctgactgtca agtttcaaca 

5281 ttcaggtctg tccccaacag gcaccacacc ggggtggact ccctgtgtaa cttctcgcca 

53 41 ctggctcgga gagtagacag agttgccatc tatgaggaat ttctgcggat gacccggaat 

5401 ggtacccagc tgcagaactt caccctggac aggagcagtg tccttgtgga tgggtattct 

5461 cccaacagaa atgagccctt aactgggaat tctgaccttc ccttctgggc tgtcatcctc 

5521 atcggcttgg caggactcct gggactcatc acatgcctga tctgcggtgt cctggtgacc 

5581 acccgccggc ggaagaagga aggagaatac aacgtccagc aacagtgccc aggctactac 

5641 cagtcacacc tagacctgga ggatctgcaa tgactggaac ttgccggtgc ctggggtgcc 

5701 tttcccccag ccagggtcca aagaagcttg gctggggcag aaataaacca tattggtcgg 

5761 i 



' . . ■: ' ' Ellis Mucin-16 (Ovarian carcinoma antigen CA125) (Ovarian 

caniir-reiited 
tumor marker CA125) (CA-125) 

Score = 3693 bits (9577), Expect =0.0, Method: Compositional matrix adjust. 

Identities = 1845/1890 (97%), Positives = 1846/1890 (97%), Gaps = 0/1890 (0%) 
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Query 1 RVDPIGPGLDRERLYWELSQLTNSITELGPYTLDRDSLYVNGFNPWSSVPTTSTPGTSTV 6 0 

R DP PGL E LYWELS LT I ELGPYTLDR SLYVNGF S PTTSTPGTSTV 

Sbjct 20263 RPDPKSPGLXXEXLYWELSXLTXXIXELGPYTLDRXSLYVNGFTHXXSXPTTSTPGTSTV 20322 

Query 61 HLATSGTPSSLPGHTAPVPLLIPFTLNFTITNLHYEENMQHPGSRKFNTTERVLQGLLKP 120 

TSGTPSS P TAPVPLLIPFTLNFTITNLHYEENMQHPGSRKFNTTERVLQGLLKP 

Sbjct 20323 XXGTSGTPSSXPXXTAPVPLLIPFTLNFTITNLHYEENMQHPGSRKFNTTERVLQGLLKP 20382 

Query 121 LFKSTSVGPLYSGCRLTLLRPEKHGAATGVDAICTLRLDPTGPGLDRERLYWELSQLTNS 180 

LFKSTSVGPLYSGCRLTLLRPEKHGAATGVDAICTLRLDPTGPGLDRERLYWELSQLTNS 

Sbjct 20383 LFKSTSVGPLYSGCRLTLLRPEKHGAATGVDAICTLRLDPTGPGLDRERLYWELSQLTNS 20442 

Query 181 VTELGPYTLDRDSLYVNGFTHRSSVPTTSIPGTSAVHLETSGTPASLPGHTAPGPLLVPF 240 

VTELGPYTLDRDSLYVNGFT RSSVPTTSIPGTSAVHLETSGTPASLPGHTAPGPLLVPF 

Sbjct 20443 VTELGPYTLDRDSLYVNGFTQRSSVPTTSIPGTSAVHLETSGTPASLPGHTAPGPLLVPF 20502 

Query 241 TLNFTITNLQYEEDMRHPGSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKR 300 

TLNFTITNLQYE DMRHPGSRKFNTTERVLQGLLKPLFKSTSVGPLYSGCRLTLLRPEKR 

Sbjct 20503 TLNFTITNLQYEVDMRHPGSRKFIITTERVLOGLLKPLFKSTSVGPLYSGCRLTLLRPEKR 20562 

Query 301 GAATGVDTICTHRLDPLNPGLDREOLYWELCKLTRGI lELGPYLLDRGSLYVllGFTHRNF 360 

GAATGVDTICTHRLDPLNPGLDREQLYWELSKLTRGIIELGPYLLDRGSLYVNGFTHRNF 

Sbjct 20563 GAATGVDTICTHRLDPLNPGLDREQLYWELSKLTRGIIELGPYLLDRGSLYVNGFTHRNF 20622 

Query 361 VPITSTPGTSTVHLGTSETPSSLPRPIVPGPLLVPFTLNFTITNLQYEEAMRHPGSRKFN 420 

VPITSTPGTSTVHLGTSETPSSLPRPIVPGPLLVPFTLNFTITNLQYEEAMRHPGSRKFN 

Sbjct 20623 VPITSTPGTSTVHLGTSETPSSLPRPIVPGPLLVPFTLNFTITHLQYEEAMRHPGSRKFII 20682 

Query 421 TTERVLQGLLRPLFKNTSIGPLYSSCRLTLLRPEKDKAATRVDAICTHHPDPQSPGLNRE 480 

TTERVLQGLLRPLFKNTSIGPLYSSCRLTLLRPEKDKAATRVDAICTHHPDPQSPGLNRE 

Sbjct 20683 TTERVLQGLLRPLFKNTSIGPLYSSCRLTLLRPEKDKAATRVDAICTHHPDPQSPGLNRE 20742 

Query 481 QLYWELSQLTHGITELGPYTLDRDSLYVDGFTHWSPIPTTSTPGTSIVNLGTSGIPPSLP 540 

QLYWELSQLTHGITELGPYTLDRDSLYVDGFTHWSPIPTTSTPGTSIVNLGTSGIPPSLP 

Sbjct 20743 QLYWELSQLTHGITELGPYTLDRDSLYVDGFTHWSPIPTTSTPGTSIVNLGTSGIPPSLP 20802 

Query 541 ETTATGPLLVPFTLNFTITNLQYEENMGHPGSRKFNITESVLQGLLKPLFKSTSVGPLYS 600 

ETT PLL PFT N TITNL M PGSRKFN TE VLQGLLKPLFKSTSVGPLYS 

Sbjct 20803 ETTXXXPLLXPFTXNXTITNLXXXXXMXXPGSRKFNTTERVLQGLLKPLFKSTSVGPLYS 20862 

Query 601 GCRLTLLRPEKDGVATRVDAICTHRPDPKIPGLDRQQLYWELSQLTHSITELGPYTLDRD 650 

GCRLTLLRPEKDGVATRVDAICTHRPDPKIPGLDRQQLYWELSQLTHSITELGPYTLDRD 

Sbjct 20863 GCRLTLLRPEKDGVATRVDAICTHRPDPKIPGLDRQQLYWELSQLTHSITELGPYTLDRD 20922 

Query 661 SLYVNGFTQRSSVPTTSTPGTFTVQPETSETPSSLPGPTATGPVLLPFTLNFTIINLQYE 720 

SLYVNGFTQRSSVPTTSTPGTFTVQPETSETPSSLPGPTATGPVLLPFTLNFTI NLQYE 

Sbjct 20923 SLYVNGFTQRSSVPTTSTPGTFTVQPETSETPSSLPGPTATGPVLLPFTLNFTITNLQYE 20982 

Query 721 EDMHRPGSRKFNTTERVLQGLLMPLFKNTSVSSLYSGCRLTLLRPEKDGAATRVDAVCTH 780 

EDMHRPGSRKFNTTERVLQGLLMPLFKNTSVSSLYSGCRLTLLRPEKDGAATRVDAVCTH 

Sbjct 20983 EDMHRPGSRKFNTTERVLQGLLMPLFKNTSVSSLYSGCRLTLLRPEKDGAATRVDAVCTH 21042 

Query 781 RPDPKSPGLDRERLYWKLSQLTHGITELGPYTLDRHSLYVNGFTHQSSMTTTRTPDTSTM 840 

RPDPKSPGLDRERLYWKLSQLTHGITELGPYTLDRHSLYVNGFTHQSSMTTTRTPDTSTM 

Sbjct 21043 RPDPKSPGLDRERLYWKLSQLTHGITELGPYTLDRHSLYVNGFTHQSSMTTTRTPDTSTM 21102 

Query 841 HLATSRTPASLSGPTTASPLLVLFTINFTITNLRYEENMHHPGSRKFNTTERVLQGLLRP 900 

HLATSRTPASLSGPTTASPLLVLFTINFTITNLRYEENMHHPGSRKFNTTERVLQGLLRP 

Sbjct 21103 HLATSRTPASLSGPTTASPLLVLFTINFTITNLRYEENMHHPGSRKFNTTERVLQGLLRP 21162 
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Query 901 VFKNTSVGPLYSGCRLTLLRPKKDGAATKVDAICTYRPDPKSPGLDREQLYWELSQLTHS 960 

VFKNTSVGPLYSGCRLTLLRPKKDGAATKVDAICTYRPDPKSPGLDREQLYWELSQLTHS 

Sbjct 21163 VFKNTSVGPLYSGCRLTLLRPKKDGAATKVDAICTYRPDPKSPGLDREQLYWELSQLTHS 21222 

Query 961 ITELGPYTLDRDSLYVNGFTQRSSVPTTSIPGTPTVDLGTSGTPVSKPGPSAASPLLVLF 1020 

ITELGPYTLDRDSLYVNGFTQRSSVPTTSIPGTPTVDLGTSGTPVSKPGPSAASPLLVLF 

Sbjct 21223 ITELGPYTLDRDSLYVNGFTQRSSVPTTSIPGTPTVDLGTSGTPVSKPGPSAASPLLVLF 21282 

Query 1021 TLNFTITNLRYEENMQHPGSRKFNTTERVLQGLLRSLFKSTSVGPLYSGCRLTLLRPEKD 1080 

TLNFTITNLRYEENMQHPGSRKFNTTERVLQGLLRSLFKSTSVGPLYSGCRLTLLRPEKD 

Sbjct 21283 TLNFTITNLRYEENMQHPGSRKFNTTERVLQGLLRSLFKSTSVGPLYSGCRLTLLRPEKD 21342 

Query 1081 GTATGVDAICTHHPDPKSPRLDREQLYKELSQLTHIIITELGPYALDIIDSLFVIIGFTHRSS 1140 

GTATGVDAICTHHPDPKSPRLDREQLYWELSQLTHIJITELG YALDIIDSLFVIIGFTHRSS 

Sbjct 21343 GTATGVDAICTHHPDPKSPRLDREQLYWELSQLTHNITELGHYALDNDSLFVNGFTHRSS 21402 

Query 1141 VSTTSTPGTPTVYLGASKTPASIFGPSAASHLLILFTLNFTITNLRYEENMWPGSRKFNT 1200 

VSTTSTPGTPTVYLGASKTPASIFGPSAASHLLILFTLNFTITNLRYEENMWPGSRKFNT 

Sbjct 21403 VSTTSTPGTPTVYLGASKTPASIFGPSAASHLLILFTLNFTITNLRYEENMWPGSRKFNT 21462 

Query 1201 TERVLQGLLRPLFKNTSVGPLYSGCRLTLLRPEKDGEATGVDAICTHRPDPTGPGLDREQ 1260 

TERVLQGLLRPLFKNTSVGPLYSG RLTLLRPEKDGEATGVDAICTHRPDPTGPGLDREQ 

Sbjct 21463 TERVLQGLLRPLFKNTSVGPLYSGSRLTLLRPEKDGEATGVDAICTHRPDPTGPGLDREQ 21522 

Query 1261 LYLELSQLTHSITELGPYTLDRDSLYVNGFTHRSSVPTTSTGWSEEPFTLNFTINNLRY 1320 

LYLELSQLTHSITELGPYTLDRDSLYVNGFTHRSSVPTTSTGWSEEPFTLNFTINNLRY 

Sbjct 21523 LYLELSQLTHSITELGPYTLDRDSLYVNGFTHRSSVPTTSTGVVSEEPFTLNFTINNLRY 21582 

Query 1321 MADMGQPGSLKFNITDNVMQHLLSPLFQRSSLGARYTGCRVIALRSVKNGAETRVDLLCT 1380 

MADMGQPGSLKFNITDNVM+HLLSPLFQRSSLGARYTGCRVIALRSVKNGAETRVDLLCT 

Sbjct 21583 MADMGQPGSLKFNITDNVMKHLLSPLFQRSSLGARYTGCRVIALRSVKNGAETRVDLLCT 21642 

Query 1381 YLQPLSGPGLPIKQVFHELSQQTHGITRLGPYSLDKDSLYLNGYNEPGPDEPPTTPKPAT 1440 

YLQPLSGPGLPIKQVFHELSQQTHGITRLGPYSLDKDSLYLNGYNEPG DEPPTTPKPAT 

Sbjct 21643 YLQPLSGPGLPIKQVFHELSQQTHGITRLGPYSLDKDSLYLNGYNEPGLDEPPTTPKPAT 21702 

Query 1441 TFLPPLSEATTAMGYHLKTLTLNFTISNLQYSPDMGKGSATFNSTEGVLQHLLRPLFQKS 1500 

TFLPPLSEATTAMGYHLKTLTLNFTISNLQYSPDMGKGSATFNSTEGVLQHLLRPLFQKS 

Sbjct 21703 TFLPPLSEATTAMGYHLKTLTLNFTISNLQYSPDMGKGSATFNSTEGVLQHLLRPLFQKS 21762 

Query 1501 SMGPFYLGCQLISLRPEKDGAATGVDTTCTYHPDPVGPGLDIQQLYWELSQLTHGVTQLG 1560 

SMGPFYLGCQLISLRPEKDGAATGVDTTCTYHPDPVGPGLDIQQLYWELSQLTHGVTQLG 

Sbjct 21763 SMGPFYLGCQLISLRPEKDGAATGVDTTCTYHPDPVGPGLDIQQLYWELSQLTHGVTQLG 21822 

Query 1561 FYVLDRDSLFINGYAPQNLSIRGEYQINFHIVNWNLSNPDPTSSEYITLLRDIQDKVTTL 1620 

FYVLDRDSLFINGYAPQNLSIRGEYQINFHIVNWNLSNPDPTSSEYITLLRDIQDKVTTL 

Sbjct 21823 FYVLDRDSLFINGYAPQNLSIRGEYQINFHIVNWNLSNPDPTSSEYITLLRDIQDKVTTL 21882 

Query 1621 YKGSQLHDTFRFCLVTNLTMDSVLVTVKALFSSNLDPSLVEQVFLDKTLNASFHWLGSTY 1680 

YKGSQLHDTFRFCLVTNLTMDSVLVTVKALFSSNLDPSLVEQVFLDKTLNASFHWLGSTY 

Sbjct 21883 YKGSQLHDTFRFCLVTNLTMDSVLVTVECALFSSNLDPSLVEQVFLDKTLNASFHWLGSTY 21942 

Query 1681 QLVDIHVTEMESSVYQPTSSSSTQHFYPNFTITNLPYSQDKAQPGTTNYQRNKRNIEDAL 1740 

QLVDIHVTEMESSVYQPTSSSSTQHFY NFTITNLPYSQDKAQPGTTNYQRNKRNIEDAL 

Sbjct 21943 QLVDIHVTEMESSVYQPTSSSSTQHFYLNFTITNLPYSQDKAQPGTTNYQRNKRNIEDAL 22002 

Query 1741 NQLFRNSSIKSYFSDCQVSTFRSVPNRHHTGVDSLCNFSPLARRVDRVAIYEEFLRMTRN 1800 

NQLFRNSSIKSYFSDCQVSTFRSVPNRHHTGVDSLCNFSPLARRVDRVAIYEEFLRMTRN 

Sbjct 22003 NQLFRNSSIKSYFSDCQVSTFRSVPNRHHTGVDSLCNFSPLARRVDRVAIYEEFLRMTRN 22062 
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Query 1801 GTQLQNFTLDRSSVLVDGYSPNRNEPLTGNSDLPFWAVILIGLAGLLGLITCLICGVLVT 1860 

GTQLQNFTLDRSSVLVDGYSPNRNEPLTGNSDLPFWAVILIGLAGLLGLITCLICGVLVT 
Sbjct 22063 GTQLQNFTLDRSSVLVDGYSPNRNEPLTGNSDLPFWAVILIGLAGLLGLITCLICGVLVT 22122 

Query 1861 TRRRKKEGEYNVQQQCPGYYQSHLDLEDLQ 1890 

TRRRKKEGEYNVQQQCPGYYQSHLDLEDLQ 
Sbjct 22123 TRRRKKEGEYNVQQQCPGYYQSHLDLEDLQ 22152 



Appendix 2 

A. Alignment of amino acids 1-1000 of SEQ ID NO: 5 to the sequences of 

Venter et al. 

AFS98121 

ID AFS98121 standard; DNA; 7521 BP. 
XX 

AC AFS98121; 
XX 

DT 20-SEP-2007 (first entry) 
XX 

DE Human transcript sequence, SEQ ID 17520. 

XX 

KW DNA detection; RNA detection; exon; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200268579-A2 . 
XX 

PD 06-SEP-2002. 
XX 

PF lO-JAN-2002; 2002WO-US000284 . 
XX 

PR lO-JAN-2001; 2001US-00756696 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Venter CJ, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2002-682812/73. 
XX 

PT New isolated nucleic acid detection reagent for detecting the presence of 

PT specified human exons. 

XX 

PS Claim 4; SEQ ID NO 17520; 40pp; English. 
XX 

CC The present invention relates to a novel isolated nucleic acid detection 

CC reagent for detecting the presence of specified human exons. The exon 

CC sequences cover every identified human transcript and exon comprising 

CC every gene/coding region of the human genome. The present sequence is one 

CC such exon sequence. The nucleic acid detection agent is used for 

CC detecting the presence of at least 100000, at least 2000, at least 50000 

CC or at least 10000 human exons. The sequences that span exon-exon 

CC junctions eliminate false signals caused by genomic contamination. This 

CC is because a detection element comprising two neighboring exons as one 

CC contiguous sequence will not hybridize to genomic DNA comprising 

CC intervening intronic DNA. These detection elements will only hybridize to 

CC expressed mRNA transcripts in which the exons are connected and the 

CC intronic sequence has been removed, therefore forming one contiguous 
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stretch of sequence corresponding to the sequence of the detect! 
element that spans the exon-exon junction. 

Sequence 7521 BP; 2172 A; 2277 C; 1433 G; 1639 T; 0 U; 0 Other; 



Alignment Scores: 
Pred. No. : 
Score: 

Percent Similarity: 
Best Local Similarity: 
Query Match: 



2 .6e-154 

4899.00 

100.0% 

100.0% 

98.0% 



Length : 
Matches: 
Conservative 
Mismatches : 
Indels : 



US-10-715-066A-5_COPY_1_1000 (1-1000) x AFS98121 (1-7521) 

Qy 21 SerArgSerThrLysAlaThrProGluMetAspSerGlyLeuThrGlyAlaThrLeuSer 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3 AGCAGGAGCACTAAAGCCACACCAGAAATGGATTCAGGACTGACAGGAGCCACCTTGTCA 62 

Qy 41 ProLysThrSerThrGlyAlalleValValThrGluHisThrLeuProPheThrSerPro 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 63 CCTAAGACATCTACAGGTGCAATCGTGGTGACAGAACATACTCTGCCCTTTACTTCCCCA 122 

Qy 61 AspLysThrLeuAlaSerProThrSerSerValValGlyArgThrThrGlnSerLeuGly 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 123 GAT7VAGACCTTGGCCAGTCCTACATCTTCGGTTGTGGGAAGAACCACCCAGTCTTTGGGG 182 



81 ValMetSerSerAlaLeuProGluSerThrSerArgGlyMetThrHisSerGluGlnArg 100 
183 GTGATGTCCTCTGCTCTCCCTGAGTCAACCTCTAGAGGAATGACACACTCCGAGCAAAGA 242 



101 ThrSerProSerLeuSerProGlnValAsnGlyThrProSerArgAsnTyrProAlaThr 120 

2 43 ACCAGCCCATCGCTGAGTCCCCAGGTCAATGGAACTCCCTCTAGGAACTACCCTGCTACA 3 02 
121 SerMetValSerGlyLeuSerSerProArgThrArgThrSerSerThrGluGlyAonPhe 140 

3 03 AGCATGGTTTCAGGATTGAGTTCCCCAAGGACCAGGACCAGTTCCACAGAAGGAAATTTT 36 2 

141 ThrLysGluAlaSerThrTyrThrLeuThrValGluThrThrSerGlyProValThrGlu 16 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

363 ACCAAAGAAGCATCTACATACACACTCACTGTAGAGACCACAAGTGGCCCAGTCACTGAG 422 

161 LysTyrThrValProThrGluThrSerThrThrGluGlyAspSerThrGluThrProTrp 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

423 AAGTACACAGTCCCCACTGAGACCTCAACAACTGAAGGTGACAGCACAGAGACCCCCTGG 482 

181 AspThrArgTyrlleProValLysIleThrSerProMetLysThrPheAlaAspSerThr 200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

483 GACACAAGATATATTCCTGTAAAAATCACATCTCCAATGAAAACATTTGCAGATTCAACT 542 

2 01 AlaSerLysGluAsnAlaProValSerMetThrProAlaGluThrThrValThrAspSer 22 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 43 GCATCCAAGGAAAATGCCCCAGTGTCTATGACTCCAGCTGAGACCACAGTTACTGACTCA 6 02 

221 HisThrProGlyArgThrAsnProSerPheGlyThrLeuTyrSerSerPheLeuAspLeu 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

603 CATACTCCAGGAAGGACAAACCCATCATTTGGGACACTTTATTCTTCCTTCCTTGACCTA 662 

241 SerProLysGlyThrProAsnSerArgGlyGluThrSerLeuGluLeuIleLeuSerThr 260 
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663 TCACCTAAAGGGACCCCAAATTCCAGAGGTGAAACAAGCCTGGAACTGATTCTATCAACC 722 

261 ThrGlYTyrProPheSerSerProGluProGlySerAlaGlyHisSerArglleSerThr 280 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

723 ACTGGATATCCCTTCTCCTCTCCTGAACCTGGCTCTGCAGGACACAGCAGAATAAGTACC 782 

2 81 SerAlaProLeuSerSerSerAlaSerValLeuAspAsnLysIleSerGluThrSerlle 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

783 AGTGCGCCTTTGTCATCATCTGCTTCAGTTCTCGATAATAAAATATCAGAGACCAGCATA 842 

301 PheSerGlyGlnSerLeuThrSerProLeuSerProGlyValProGluAlaArgAlaSer 320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

843 TTCTCAGGCCAGAGTCTCACCTCCCCTCTGTCTCCTGGGGTGCCCGAGGCCAGAGCCAGC 902 

321 ThrMetProAsnSerAlalleProPheSerMetThrLeuSerAsnAlaGluThrSerAla 340 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

903 ACAATGCCCAACTCAGCTATCCCTTTTTCCATGACACTAAGCAATGCAGAAACAAGTGCC 962 

3 41 GluArgValArgSerThrlleSerSerLeuGlyThrProSerlleSerThrLycGlnThr 36 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

963 GAAAGGGTCAGAAGCACAATTTCCTCTCTGGGGACTCCATCAATATCCACAAAGCAGACA 1022 

361 AlaGluThrlleLeuThrPheHisAlaPheAlaGluThrMetAspIleProSerThrHis 380 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1023 GCAGAGACTATCCTTACCTTCCATGCCTTCGCTGAGACCATGGATATACCCAGCACCCAC 1082 

381 IleAlaLysThrLeuAlaSerGluTrpLeuGlySerProGlyThrLeuGlyGlyThrSer 400 

1083 ATAGCCAAGACTTTGGCTTCAGAATGGTTGGGAAGTCCAGGTACCCTTGGTGGCACCAGC 1142 

401 ThrSerAlaLeuThrThrThrSerProSerThrThrLeuValSerGluGluThrAsnThr 420 

1143 ACTTCAGCGCTGACAACCACATCTCCATCTACCACTTTAGTCTCAGAGGAGACCAACACC 12 02 

421 HioHisSerThrSerGlyLysGluThrGluGlyThrLeuAonThrGerMetThrProLeu 44 0 

12 03 CATCACTCCACGAGTGGAAAGGAGACAGAAGGAACTTTGAATACATCTATGACTCCACTT 12 62 

441 GluThrSerAlaProGlyGluGluSerGluMetThrAlaThrLeuValProThrLeuGly 46 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1263 GAGACCTCTGCTCCTGGAGAAGAGTCCGAAATGACTGCCACCTTGGTCCCCACTCTAGGT 1322 

461 PheThrThrLeuAspSerLysIleArgSerProSerGlnValSerSerSerHisProThr 480 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1323 TTTACAACTCTTGACAGCAAGATCAGAAGTCCATCTCAGGTCTCTTCATCCCACCCAACA 1382 

481 ArgGluLeuArgThrThrGlySerThrSerGlyArgGlnSerSerSerThrAlaAlaHis 500 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1383 AGAGAGCTCAGAACCACAGGCAGCACCTCTGGGAGGCAGAGTTCCAGCACAGCTGCCCAC 1442 

5 01 GlySerSerAspIleLeuArgAlaThrThrSerSerThrSerLysAlaSerSerTrpThr 52 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

14 43 GGGAGCTCTGACATCCTGAGGGCAACCACTTCCAGCACCTCAAAAGCATCATCATGGACC 15 02 

521 SerGluSerThrAlaGlnGlnPheSerGluProGlnHisThrGlnTrpValGluThrSer 540 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1503 AGTGAAAGCACAGCTCAGCAATTTAGTGAACCCCAGCACACACAGTGGGTGGAGACAAGT 1562 

541 ProSerMetLysThrGluArgProProAlaSerThrSerValAlaAlaProIleThrThr 560 
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1563 CCTAGCATGAAAACAGAGAGACCCCCAGCATCAACCAGTGTGGCAGCCCCTATCACCACT 1622 

561 SerValProSerValValSerGlyPheThrThrLeuLYsThrSerSerThrLysGlylle 580 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

16 23 TCTGTTCCCTCAGTGGTCTCTGGCTTCACCACCCTGAAGACCAGCTCCACAAAAGGGATT 16 82 

581 TrpLeuGluGluThrSerAlaAspThrLeuIleGlyGluSerThrAlaGlyProThrThr 600 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1683 TGGCTTGAAGAAACATCTGCAGACACACTCATCGGAGAATCCACAGCTGGCCCAACCACC 1742 

601 HisGlnPheAlaValProThrGlylleSerMetThrGlyGlySerSerThrArgGlySer 620 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1743 CATCAGTTTGCTGTTCCCACTGGGATTTCAATGACAGGAGGCAGCAGCACCAGGGGAAGC 1802 

621 GlnGlyThrThrHisLeuLeuThrArgAlaThrAlaSerSerGluThrSerAlaAspLeu 640 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1803 CAGGGCACAACCCACCTACTCACCAGAGCCACAGCATCATCTGAGACATCCGCAGATTTG 1862 

6 41 ThrLeuAlaThrAsnGlyValProValSerValSerProAlaValSerLYsThrAlaAla 66 0 

1863 ACTCTGGCCACGAACGGTGTCCCAGTCTCCGTGTCTCCAGCAGTGAGCAAGACGGCTGCT 1922 

661 GlySerSerProProGlyGlyThrLysProSerTyrThrMetValSerSerValllePro 680 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1923 GGCTCAAGTCCTCCAGGAGGGACAAAGCCATCATATACAATGGTTTCTTCTGTCATCCCT 1982 

681 GluThrSerSerLeuGlnSerSerAlaPheArgGluGlyThrSerLeuGlyLeuThrPro 700 

1983 GAGACATCATCTCTACAGTCCTCAGCTTTCAGGGAAGGAACCAGCCTGGGACTGACTCCA 2042 

701 LeuAsnThrArgHisProPheSerSerProGluProAspSerAlaGlyHisThrLysIle 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2043 TTAAACACTAGACATCCCTTCTCTTCCCCTGAACCAGACTCTGCAGGACACACCAAGATA 2102 

721 GerThrGerlleProLeuLeuGerGerAlaSerValLeuGluAspLysValSerAlaThr 74 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2103 AGCACCAGCATTCCTCTGTTGTCATCTGCTTCAGTTCTTGAGGATAAAGTGTCAGCGACC 2162 

741 SerThrPheSerHisHisLysAlaThrSerSerlleThrThrGlyThrProGluIleSer 76 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2163 AGCACATTCTCACACCACAAAGCCACCTCATCTATTACCACAGGGACTCCTGAAATCTCA 2222 

761 ThrLysThrLysProSerSerAlaValLeuSerSerMetThrLeuSerAsnAlaAlaThr 780 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2223 ACAAAGACAAAGCCCAGCTCAGCCGTTCTTTCCTCCATGACCCTAAGCAATGCAGCAACA 2282 

781 SerProGluArgValArgAsnAlaThrSerProLeuThrHisProSerProSerGlyGlu 800 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2283 AGTCCTGAAAGAGTCAGAAATGCAACTTCCCCTCTGACTCATCCATCTCCATCAGGGGAA 2342 

8 01 GluThrAlaGlySerValLeuThrLeuSerThrSerAlaGluThrThrAspSerProAsn 82 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23 43 GAGACAGCAGGGAGTGTCCTCACTCTCAGCACCTCTGCTGAGACTACAGACTCACCTAAC 2402 

821 IleHisProThrGlyThrLeuThrSerGluSerSerGluSerProSerThrLeuSerLeu 840 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2403 ATCCACCCAACTGGGACACTGACTTCAGAATCGTCAGAGAGTCCTAGCACTCTCAGCCTC 2462 

841 ProSerValSerGlyValLysThrThrPheSerSerSerThrProSerThrHisLeuPhe 860 
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Db 2463 CCAAGTGTCTCTGGAGTCAAAACCACATTTTCTTCATCTACTCCTTCCACTCATCTATTT 2522 

Qy 861 ThrSerGlyGluGluThrGluGluThrSerAsnProSerValSerGlnProGluThrSer 880 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2523 ACTAGTGGAGAAGAAACAGAGGAAACTTCGAATCCATCTGTGTCTCAACCTGAGACTTCT 2582 

Qy 881 ValSerArgValArgThrThrLeuAlaSerThrSerValProThrProValPheProThr 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2583 GTTTCCAGAGTAAGGACCACCTTGGCCAGCACCTCTGTCCCTACCCCAGTATTCCCCACC 2642 

Qy 901 MetAspThrTrpProThrArgSerAlaGlnPheSerSerSerHisLeuValSerGluLeu 920 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2643 ATGGACACCTGGCCTACACGTTCAGCTCAGTTCTCTTCATCCCACCTAGTGAGTGAGCTC 2702 

Qy 921 ArgAlaThrSerSerThrSerValThrAsnSerThrGlySerAlaLeuProLysIleSer 940 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2703 AGAGCTACGAGCAGTACCTCAGTTACAAACTCAACTGGTTCAGCTCTTCCTAAAATATCT 2762 

Qy 9 41 HisLeuThrGlyThrAlaThrMetSerGlnThrAsnArgAspThrPheAsnAspSerAla 96 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2763 CACCTCACTGGGACGGCAACAATGTCACAGACCAATAGAGACACGTTTAATGACTCTGCT 2822 

Qy 961 AlaProGlnSerThrThrTrpProGluThrSerProArgPheLysThrGlyLeuProSer 980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2823 GCACCCCAAAGCACAACTTGGCCAGAGACTAGTCCCAGATTCAAGACAGGGTTACCTTCA 2882 

Qy 981 AlaThrThrThrValSerThrSerAlaThrSerLeuSerAlaThrValMetValSerLys 1000 

Db 2883 GCAACAACCACTGTTTCAACCTCTGCCACTTCTCTCTCTGCTACTGTAATGGTCTCTAAA 2942 

B. Alignment of amino acids 10,000-12,000 of SEQ ID NO: 5 to the sequences 
of Venter et al . 
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New isolated nucleic acid detectie 
specified human exons. 



I reagent for detecting the presence of 



Cla 



1 4; SEQ ID NO 23771; 40pp; Engli; 



The present invention relates to a novel isolated nucleic acid detection 
reagent for detecting the presence of specified human exons. The exon 
sequences cover every identified human transcript and exon comprising 
every gene/coding region of the human genome. The present sequence is on< 
such exon sequence. The nucleic acid detection agent is used for 
detecting the presence of at least 100000, at least 2000, at least 50000 
or at least 10000 human exons. The sequences that span exon-exon 
junctions eliminate false signals caused by genomic contamination. This 
is because a detection element comprising two neighboring exons as one 
contiguous sequence will not hybridize to genomic DNA comprising 
intervening intronic DNA. These detection elements will only hybridi 
expressed mRNA transcripts in which the exons are connected and the 
intronic sequence has been removed, therefore forming one contiguous 
stretch of sequence corresponding to the sequence of the detection 
element that spans the exon-exon junction. 

Sequence 25080 BP; 6952 A; 7729 C; 4583 G; 5816 T; 0 U; 0 Other; 



to 



SQ 

Alignment Sco] 
Pred. No. : 

Best Local Sii 
Query Match: 
DB: 



Length: 

Conservative : 
Mismatches : 
Indels: 



US-10-715-066A-5_COPY_10000_12000 (1-2001) x AFT04373 (1-25080) 

Qy 1 ThrLeuMetSerArgSerProGluAsnProSerTrpLysSerSerProPheValGluLys 20 

Db 18568 ACTCTTATGAGTAGGAGTCCTGAAAATCCATCATGGAAGAGCTCTCCCTTTGTGGAAAAA 18627 

Qy 21 ThrSerSerSerSerSerLeuLeuSerLeuProValThrThrSerProSerValSerSer 40 

Db 18628 ACTAGCTCTTCATCTTCTCTGTTGTCCTTACCTGTCACGACCTCACCTTCTGTTTCTTCC 18687 

Qy 41 ThrLeuProGlnSerlleProSerSerSerPheSerValThrSerLeuLeuThrProGly 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I ! M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 18688 ACATTACCGCAGAGTATCCCTTCCTCCTCTTTTTCTGTGACTTCACTCCTCACCCCAGGC 18747 

Qy 61 MetValLysThrThrAspThrSerThrGluProGlyThrSerLeuSerProAsnLeuSer 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18748 ATGGTGAAGACTACAGACACAAGCACAGAACCTGGAACCAGTTTATCTCCAAATCTGAGT 18807 

Qy 81 GlyThrSerValGluIleLeuAlaAlaSerGluValThrThrAspThrGluLysIleHis 100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18808 GGCACCTCAGTTGAAATACTGGCTGCCTCTGAAGTCACCACAGATACAGAGAAAATTCAT 18867 

Qy 101 ProSerSerSerMetAlaValThrAsnValGlyThrThrSerSerGlyHisGluLeuTyr 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18868 CCTTCTTCAAGCATGGCAGTGACCAATGTGGGAACCACCAGTTCTGGACATGAACTATAT 18927 



121 SerSerValSerlleHisSerGluProSerLysAlaThrTyrProValGlyThrProSer 140 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Application/Control Number: 10/715,066 



Page 31 



Art Unit: 1643 



18928 TCCTCTGTTTCAATCCACTCGGAGCCATCCAAGGCTACATACCCAGTGGGTACTCCCTCT 18987 

141 SerMetAlaGluThrSerlleSerThrSerMetProAlaAsnPheGluThrThrGlyPhe 160 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
18988 TCCATGGCTGAAACCTCTATTTCCACATCAATGCCTGCTAATTTTGAGACCACAGGATTT 19047 

161 GluAlaGluProPheSerHisLeuThrSerGlyPheArgLysThrAsnMetSerLeuAsp 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19048 GAGGCTGAGCCATTTTCTCATTTGACTTCTGGACTTAGGAAGACCAACATGTCCCTGGAC 19107 

181 ThrSerSerValThrProThrAsnThrProSerSerProGlySerThrHisLeuLeuGln 200 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

19108 ACCAGCTCAGTCACACCAACAAATACACCTTCTTCTCCTGGGTCCACTCACCTTTTACAG 1916 7 

2 01 SerSerLysThrAspPheThrSerSerAlaLysThrSerSerProAspTrpProProAla 22 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19168 AGTTCCAAGACTGATTTCACCTCTTCTGCAAAAACATCATCCCCAGACTGGCCTCCAGCC 1922 7 

221 SerGlnTyrThrGluIleProValAspIlelleThrProPheAsnAlaSerProSerlle 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19228 TCACAGTATACTGAAATTCCAGTGGACATAATCACCCCCTTTAATGCTTCTCCATCTATT 19287 

241 ThrGluSerThrGlylleThrSerPheProGluSerArgPheThrMetSerValThrGlu 260 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19288 ACGGAGTCCACTGGGATAACCTCCTTCCCAGAATCCAGGTTTACTATGTCTGTAACAGAA 19347 

261 SerThrHisHisLeuSerThrAspLeuLeuProSerAlaGluThrlleSerThrGlyThr 280 

193 48 AGTACTCATCATCTGAGTACAGATTTGCTGCCTTCAGCTGAGACTATTTCCACTGGCACA 19 40 7 

281 ValMetProSerLeuSerGluAlaMetThrSerPheAlaThrThrGlyValProArgAla 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19408 GTGATGCCTTCTCTATCAGAGGCCATGACTTCATTTGCCACCACTGGAGTTCCACGAGCC 1946 7 

301 IleSerGlySerGlySerProPheSerArgThrGluSerGlyProGlyAspAlaThrLeu 320 
I I I I I I I I I I I I I I I 

19468 ATCTCAGGTTCAGGA 19482 

321 SerThrlleAlaGluSerLeuProSerSerThrProValProPheSerSerSerThrPhe 340 

19482 19482 

3 41 ThrThrThrAspSerSerThrlleProAlaLeuHisGluIleThrSerSerSerAlaThr 36 0 
19482 19482 

361 ProTyrArgValAspThrSerLeuGlyThrGluSerSerThrThrGluGlyArgLeuVal 380 

19482 19482 

381 MetValSerThrLeuAspThrSerSerGlnProGlyArgThrSerSerThrProIleLeu 400 

19482 19482 

401 AspThrArgMetThrGluSerValGluLeuGlyThrValThrSerAlaTyrGlnValPro 420 

19482 19482 

421 SerLeuSerThrArgLeuThrArgThrAspGlylleMetGluHisIleThrLysIlePro 440 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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19483 ACTGATGGCATTATGGAACACATCACAAAAATACCC 19518 

441 AsnGluAlaAlaHisArgGlyThrlleArgProValLysGlyProGlnThrSerThrSer 460 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19519 AATGAAGCAGCACACAGAGGTACCATAAGACCAGTCAAAGGCCCTCAGACATCCACTTCG 19578 

461 ProAlaSerProLYsGlYLeuHisThrGlYGlyThrLysArgMetGluThrThrThrThr 480 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19579 CCTGCCAGTCCTAAAGGACTACACACAGGAGGGACAAAAAGAATGGAGACCACCACCACA 19638 

481 AlaLeuLysThrThrThrThrAlaLeuLysThrThrSerArgAlaThrLeuThrThrSer 500 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

196 3 9 GCTCTGAAGACCACCACCACAGCTCTGAAGACCACTTCCAGAGCCACCTTGACCACCAGT 19 6 9 8 

501 ValTyrThrProThrLeuGlyThrLeuThrProLeuAsnAlaSerArgGlnMetAlaSer 52 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19699 GTCTATACTCCCACTTTGGGAACACTGACTCCCCTCAATGCATCAATGCAAATGGCCAGC 19758 

521 ThrlleLeuThrGluMetMetlleThrThrProTyrValPheProAspValProGluThr 540 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19759 ACAATCCCCACAGAAATGATGATCACAACCCCATATGTTTTCCCTGATGTTCCAGAAACG 19818 

541 ThrSerSerLeuAlaThrSerLeuGlyAlaGluThrSerThrAlaLeuProArgThrThr 560 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19819 ACATCCTCATTGGCTACCAGCCTGGGAGCAGAAACCAGCACAGCTCTTCCCAGGACAACC 19878 

561 ProSerValLeuAsnArgGluSerGluThrThrAlaSerLeuValSerArgSerGlyAla 580 

198 79 CCATCTGTTTTCAATAGAGAATCAGAGACCACAGCCTCACTGGTCTCTCGTTCTGGGGCA 19938 

581 GluArgSerProVallleGlnThrLeuAspValSerSerSerGluProAspThrThrAla 600 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
19939 GAGAGAAGTCCGGTTATTCAAACTCTAGATGTTTCTTCTAGTGAGCCAGATACAACAGCT 19998 

601 SerTrpVallleHisProAlaGluThrlleProThrValSerLysThrThrProAsnPhe 620 

19999 TCATGGGTTATCCATCCTGCAGAGACCATCCCAACTGTTTCCAAGACAACCCCCAATTTT 20058 

621 PheHisSerGluLeuAspThrValSerSerThrAlaThrSerHisGlyAlaAspValSer 640 

2 0059 TTCCACAGTGAATTAGACACTGTATCTTCCACAGCCACCAGTCATGGGGCAGACGTCAGC 20118 

6 41 SerAlalleProThrAsnlleSerProSerGluLeuAspAlaLeuThrProLeuValThr 66 0 

2 0119 TCAGCCATTCCAACAAATATCTCACCTAGTGAACTAGATGCACTGACCCCACTGGTCACT 2 017 8 

661 IleSerGlyThrAspThrSerThrThrPheProThrLeuThrLysSerProHisGluThr 680 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20179 ATTTCGGGGACAGATACTAGTACAACATTCCCAACACTGACTAAGTCCCCACATGAAACA 20238 

681 GluThrArgThrThrTrpLeuThrHisProAlaGluThrSerSerThrlleProArgThr 700 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20239 GAGACAAGAACCACATGGCTCACTCATCCTGCAGAGACCAGCTCAACTATTCCCAGAACA 20298 

701 IleProAsnPheSerHisHisGluSerAspAlaThrProSerlleAlaThrSerProGly 720 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20299 ATCCCCAATTTTTCTCATCATGAATCAGATGCCACACCTTCAATAGCCACCAGTCCTGGG 20358 



721 AlaGluThrSerSerAlalleProIleMetThrValSerProGlyAlaGluAspLeuVal 740 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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2 0359 GCAGAAACCAGTTCAGCTATTCCAATTATGACTGTCTCACCTGGTGCAGAAGATCTGGTG 20418 

741 ThrSerGlnValThrSerSerGlyThrAspArgAsnMetThrlleProThrLeuThrLeu 760 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20419 ACCTCACAGGTCACTAGTTCTGGCACAGACAGAAATATGACTATTCCAACTTTGACTCTT 20478 

761 SerProGlyGluProLysThrlleAlaSerLeuValThrHisProGluAlaGlnThrSer 780 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20479 TCTCCTGGTGAACCAAAGACCATAGCCTCATTAGTCACCCATCCTGAAGCACAGACAAGT 20538 

781 SerAlalleProThrSerThrlleSerProAlaValSerArgLeuValThrSerMetVal 800 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 053 9 TCGGCCATTCCAACTTCAACTATCTCGCCTGCTGTATCACGGTTGGTGACCTCAATGGTC 2 0 59 8 

801 ThrSerLeuAlaAlaLysThrSerThrThrAsnArgAlaLeuThrAsnSerProGlyGlu 82 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20599 ACCAGTTTGGCGGCAAAGACAAGTACAACTAATCGAGCTCTGACAAACTCCCCTGGTGAA 20658 

821 ProAlaThrThrValSerLeuValThrHisProAlaGlnThrSerProThrValProTrp 840 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20659 CCAGCTACAACAGTTTCATTGGTCACGCATTCTGCACAGACCAGCCCAACAGTTCCCTGG 20718 

841 ThrThrSerllePhePheHisSerLysSerAspThrThrProSerMetThrThrSerHis 860 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20719 ACAACTTCCATTTTTTTCCATAGTAAATCAGACACCACACCTTCAATGACCACCAGTCAT 20778 

861 GlyAlaGluSerSerSerAlaValProThrProThrValSerThrGluValProGlyVal 88 0 

2 07 79 GGGGCAGAATCCAGTTCAGCTGTTCCAACTCCAACTGTTTCAACTGAGGTACCAGGAGTA 20838 

881 ValThrProLeuValThrSerSerArgAlaVallleSerThrThrlleProIleLeuThr 900 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
20839 GTGACCCCTTTGGTCACCAGTTCTAGGGCAGTGATCAGTACAACTATTCCAATTCTGACT 20898 

901 LeuSerProGlyGluProGluThrThrProSerMetAlaThrSerHisGlyGluGluAla 920 

20899 CTTTCTCCTGGTGAACCAGAGACCACACCTTCAATGGCCACCAGTCATGGGGAAGAAGCC 20958 

921 SerSerAlalleProThrProThrValSerProGlyValProGlyValValThrSerLeu 940 

2 09 59 AGTTCTGCTATTCCAACTCCAACTGTTTCACCTGGGGTACCAGGAGTGGTGACCTCTCTG 21018 

9 41 ValThrSerSerArgAlaValThrSerThrThrlleProIleLeuThrPheSerLeuGly 96 0 

21019 GTCACTAGTTCTAGGGCAGTGACTAGTACA_ACTATTCCAATTCTGACTTTTTCTCTTGGT 210 7 8 

961 GluProGluThrThrProSerMetAlaThrSerHisGlyThrGluAlaGlySerAlaVal 980 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
21079 GAACCAGAGACCACACCTTCAATGGCCACCAGTCATGGGACAGAAGCTGGCTCAGCTGTT 21138 

981 ProThrValLeuProGluValProGlyMetValThrSerLeuValAlaSerSerArgAla 1000 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
21139 CCAACTGTTTTACCTGAGGTACCAGGAATGGTGACCTCTCTGGTTGCTAGTTCTAGGGCA 21198 

1001 ValThrSerThrThrLeuProThrLeuThrLeuSerProGlyGluProGluThrThrPro 1020 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
21199 GTAACCAGTACAACTCTTCCAACTCTGACTCTTTCTCCTGGTGAACCAGAGACCACACCT 21258 



1021 SerMetAlaThrSerHisGlyAlaGluAlaSerSerThrValProThrValSerProGlu 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Application/Control Number: 10/715,066 



Page 34 



Art Unit: 1643 



212 59 TCAATGGCCACCAGTCATGGGGCAGAAGCCAGCTCAACTGTTCCAACTGTTTCACCTGAG 21318 

1041 ValProGlyValValThrSerLeuValThrSerSerSerGlyValAsnSerThrSerlle 1060 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
21319 GTACCAGGAGTGGTGACCTCTCTGGTCACTAGTTCTAGTGGAGTAAACAGTACAAGTATT 21378 

1061 ProThrLeuIleLeuSerProGlyGluLeuGluThrThrProSerMetAlaThrSerHis 1080 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

213 79 CCAACTCTGATTCTTTCTCCTGGTGAACTAGAAACCACACCTTCAATGGCCACCAGTCAT 21438 

1081 GlyAlaGluAlaSerSerAlaValProThrProThrValSerProGlyValSerGlyVal 1100 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2143 9 GGGGCAGAAGCCAGCTCAGCTGTTCCAACTCCAACTGTTTCACCTGGGGTATCAGGAGTG 2149 8 

1101 ValThrProLeuValThrSerSerArgAlaValThrSerThrThrlleProIleLeuThr 1120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
21499 GTGACCCCTCTGGTCACTAGTTCCAGGGCAGTGACCAGTACAACTATTCCAATTCTAACT 21558 

1121 LeuSerSerSerGluProGluThrThrProSerMetAlaThrSerHisGlyValGluAla 1140 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
21559 CTTTCTTCTAGTGAGCCAGAGACCACACCTTCAATGGCCACCAGTCATGGGGTAGAAGCC 21618 

1141 SerSerAlaValLeuThrValSerProGluValProGlyMetValThrSerLeuValThr 1160 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
21619 AGCTCAGCTGTTCTAACTGTTTCACCTGAGGTACCAGGAATGGTGACCTTTCTGGTCACT 21678 

1161 SerSerArgAlaValThrSerThrThrlleProThrLeuThrlleSerSerAspGluPro 1180 

216 79 AGTTCTAGAGCAGTAACCAGTACAACTATTCCAACTCTGACTATTTCTTCTGATGAACCA 21738 

1181 GluThrThrThrSerLeuValThrHisSerGluAlaLysMetlleSerAlalleProThr 1200 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
21739 GAGACCACAACTTCATTGGTCACCCATTCTGAGGCAAAGATGATTTCAGCCATTCCAACT 21798 

1201 LeuAlaValSerProThrValGlnGlyLeuValThrSerLeuValThrSerSerGlySer 1220 

21799 TTAGGTGTCTCCCCTACTGTACAAGGGCTGGTGACTTCACTGGTCACTAGTTCTGGGTCA 21858 

1221 GluThrSerAlaPheSerAsnLeuThrValAlaSerSerGlnProGluThrlleAspSer 1240 

21859 GAGACCAGTGCGTTTTCAAATCTAACTGTTGCCTCAAGTCAACCAGAGACCATAGACTCA 21918 

12 41 TrpValAlaHisProGlyThrGluAlaSerSerValValProThrLeuThrValSerThr 12 6 0 

21919 TGGGTCGCTCATCCTGGGACAGAAGCAAGTTCTGTTGTTCCAACTTTGACTGTCTCCACT 219 7 8 

1261 GlyGluProPheThrAsnlleSerLeuValThrHisProAlaGluSerSerSerThrLeu 12 80 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
219 79 GGTGAGCCGTTTACAAATATCTCATTGGTCACCCATCCTGCAGAGAGTAGCTCAACTCTT 22038 

1281 ProArgThrThrSerArgPheSerHisSerGluLeuAspThrMetProSerThrValThr 1300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22039 CCCAGGACAACCTCAAGGTTTTCCCACAGTGAATTAGACACTATGCCTTCTACAGTCACC 22098 

1301 SerProGluAlaGluSerSerSerAlalleSerThrThrlleSerProGlylleProGly 1320 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22099 AGTCCTGAGGCAGAATCCAGCTCAGCCATTTCAACAACTATTTCACCTGGTATACCAGGT 22158 



1321 ValLeuThrSerLeuValThrSerSerGlyArgAspIleSerAlaThrPheProThrVal 1340 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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2 2159 GTGCTGACATCACTGGTCACTAGCTCTGGGAGAGACATCAGTGCAACTTTTCCAACAGTG 22218 

1341 ProGluSerProHisGluSerGluAlaThrAlaSerTrpValThrHisProAlaValThr 1360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22219 CCTGAGTCCCCACATGAATCAGAGGCAACAGCCTCATGGGTTACTCATCCTGCAGTCACC 22278 

1361 SerThrThrValProArgThrThrProAsnTyrSerHisSerGluProAspThrThrPro 1380 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
222 79 AGCACAACAGTTCCCAGGACAACCCCTAATTATTCTCATAGTGAACCAGACACCACACCA 22338 

1381 SerlleAlaThrSerProGlyAlaGluAlaThrSerAspPheProThrlleThrValSer 1400 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 23 3 9 TCAATAGCCACCAGTCCTGGGGCAGAAGCCACTTCAGATTTTCCAACAATAACTGTCTCA 22 3 9 8 

1401 ProAspValProAspMetValThrSerGlnValThrSerSerGlyThrAspThrSerlle 1420 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22399 CCTGATGTACCAGATATGGTAACCTCACAGGTCACTAGTTCTGGGACAGACACCAGTATA 22458 

1421 ThrlleProThrLeuThrLeuSerSerGlyGluProGluThrThrThrSerPhelleThr 1440 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22459 ACTATTCCAACTCTGACTCTTTCTTCTGGTGAGCCAGAGACCACAACCTCATTTATCACC 22518 

1441 TyrSerGluThrHisThrSerSerAlalleProThrLeuProValSerProGlyAlaSer 1460 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22519 TATTCTGAGACACATACAAGTTCAGCCATTCCAACTCTCCCTGTCTCCCCTGATGCATCA 22578 

1461 LysMetLeuThrSerLeuVallleSerSerGlyThrAspSerThrThrThrPheProThr 1480 

225 79 AAGATGCTGACCTCACTGGTCATCAGTTCTGGGACAGACAGCACTACAACTTTCCCAACA 22638 

1481 LeuThrGluThrProTyrGluProGluThrThrAlalleGlnLeuIleHisProAlaGlu 1500 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22639 CTGACGGAGACCCCATATGAACCAGAGACAACAGCCATACAGCTCATTCATCCTGCAGAG 22698 

1501 ThrAsnThrMetValProArgThrThrProLysPheSerHisSerLysSerAspThrThr 1520 

22699 ACCAACACAATGGTTCCCAGGACAACTCCCAAGTTTTCCCATAGTAAGTCAGACACCACA 22 758 

1521 LeuProValAlalleThrSerProGlyProGluAlaSerSerAlaValSerThrThrThr 1540 

22 759 CTCCCAGTAGCCATCACCAGTCCTGGGCCAGAAGCCAGTTCAGCTGTTTCAACGACAACT 22 818 

15 41 IleSerProAspMetSerAspLeuValThrSerLeuValProSerSerGlyThrAspThr 156 0 

2 2 819 ATCTCACCTGATATGTCAGATCTGGTGACCTCACTGGTCCCTAGTTCTGGGACAGACACC 22 8 7 8 

1561 SerThrThrPheProThrLeuSerGluThrProTyrGluProGluThrThrAlaThrTrp 15 80 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22879 AGTACAACCTTCCCAACATTGAGTGAGACCCCATATGAACCAGAGACTACAGCCACGTGG 22938 

1581 LeuThrHisProAlaGluThrSerThrThrValSerGlyThrlleProAsnPheSerHis 1600 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22939 CTCACTCATCCTGCAGAAACCAGCACAACGGTTTCTGGGACAATTCCCAACTTTTCCCAT 22998 

1601 ArgGlySerAspThrAlaProSerMetValThrSerProGlyValAspThrArgSerGly 1620 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
22999 AGGGGATCAGACACTGCACCCTCAATGGTCACCAGTCCTGGAGTAGACACGAGGTCAGGT 23058 



1621 ValProThrThrThrlleProProSerlleProGlyValValThrSerGlnValThrSer 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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2 3 0 59 GTTCCAACTACAACCATCCCACCCAGTATACCAGGGGTAGTGACCTCACAGGTCACTAGT 23118 

1641 SerAlaThrAspThrSerThrAlalleProThrLeuThrProSerProGlyGluProGlu 1660 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23119 TCTGCAACAGACACTAGTACAGCTATTCCAACTTTGACTCCTTCTCCTGGTGAACCAGAG 23178 

1661 ThrThrAlaSerSerAlaThrHisProGlyThrGlnThrGlyPheThrValProIleArg 1680 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23179 ACCACAGCCTCATCAGCTACCCATCCTGGGACACAGACTGGCTTCACTGTTCCAATTCGG 23238 

16 81 ThrValProSerSerGluProAspThrMetAlaSerTrpValThrHisProProGlnThr 1700 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

2 32 3 9 ACTGTTCCCTCTAGTGAGCCAGATACAATGGCTTCCTGGGTCACTCATCCTCCACAGACC 23 2 9 8 

1701 SerThrProValSerArgThrThrSerSerPheSerHisSerSerProAspAlaThrPro 1720 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23299 AGCACACCTGTTTCCAGAACAACCTCCAGTTTTTCCCATAGTAGTCCAGATGCCACACCT 23358 

1721 ValMetAlaThrSerProArgThrGluAlaSerSerAlaValLeuThrThrlleSerPro 1740 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23359 GTAATGGCCACCAGTCCTAGGACAGAAGCCAGTTCAGCTGTACTGACAACAATCTCACCT 23418 

1741 GlyAlaProGluMetValThrSerGlnlleThrSerSerGlyAlaAlaThrSerThrThr 1760 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23419 GGTGCACCAGAGATGGTGACTTCACAGATCACTAGTTCTGGGGCAGCAACCAGTACAACT 23478 

1761 ValProThrLeuThrHisSerProGlyMetProGluThrThrAlaLeuLeuSerThrHis 1780 

234 79 GTTCCAACTTTGACTCATTCTCCTGGTATGCCAGAGACCACAGCCTTATTGAGCACCCAT 2353 8 

1781 ProArgThrGluThrSerLysThrPheProAlaSerThrValPheProGlnValSerGlu 1800 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23539 CCCAGAACAGAGACAAGTAAAACATTTCCTGCTTCAACTGTGTTTCCTCAAGTATCAGAG 23598 

1801 ThrThrAlaSerLeuThrlleArgProGlyAlaGluThrSerThrAlaLeuProThrGln 1820 

23599 ACCACAGCCTCACTCACCATTAGACCTGGTGCAGAGACTAGCACAGCTCTCCCAACTCAG 23658 

1821 ThrThrSerSerLeuPheThrLeuLeuValThrGlyThrSerArgValAspLeuSerPro 1840 

236 59 ACAACATCCTCTCTCTTCACCCTACTTGTAACTGGAACCAGCAGAGTTGATCTAAGTCCA 23 718 

18 41 ThrAlaSerProGlyValSerAlaLysThrAlaProLeuSerThrHisProGlyThrGlu 186 0 

2 3 719 ACTGCTTCACCTGGTGTTTCTGCAAAAACAGCCCCACTTTCCACCCATCCAGGGACAGAA 23 7 7 8 

1861 ThrSerThrMetlleProThrSerThrLeuSerLeuGlyLeuLeuGluThrThrGlyLeu 1880 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23779 ACCAGCACAATGATTCCAACTTCAACTCTTTCCCTTGGTTTACTAGAGACTACAGGCTTA 23838 

1881 LeuAlaThrSerSerSerAlaGluThrSerThrSerThrLeuThrLeuThrValSerPro 1900 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23839 CTGGCCACCAGCTCTTCAGCAGAGACCAGCACGAGTACTCTAACTCTGACTGTTTCCCCT 23898 

1901 AlaValSerGlyLeuSerSerAlaSerlleThrThrAspLysProGlnThrValThrSer 1920 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
23899 GCTGTCTCTGGGCTTTCCAGTGCCTCTATAACAACTGATAAGCCCCAAACTGTGACCTCC 23958 



1921 TrpAsnThrGluThrSerProSerValThrSerValGlyProProGluPheSerArgThr 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 23959 TGGAACACAGAAACCTCACCATCTGTAACTTCAGTTGGACCCCCAGAATTTTCCAGGACT 24018 

Qy 1941 ValThrGlyThrThrMetThrLeuIleProSerGluMetProThrProProLysThrSer 1960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 24019 GTCACAGGCACCACTATGACCTTGATACCATCAGAGATGCCAACACCACCTAAAACCAGT 24078 

Qy 1961 HisGlyGluGlyValSerProThrThrlleLeuArgThrThrMetValGluAlaThrAsn 1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 24079 CATGGAGAAGGAGTGAGTCCAACCACTATCTTGAGAACTACAATGGTTGAAGCCACTAAT 24138 

Qy 1981 LeuAlaThrThrGlySerSerProThrValAlaLysThrThrThrThrPheAsnThrLeu 2000 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2 413 9 TTAGCTACCACAGGTTCCAGTCCCACTGTGGCCAAGACAACAACCACCTTCAATACACTG 24198 

Qy 2001 Ala 2001 
I I I 

Db 24199 GOT 24201 

C. Alignment of amino acids 19,152-21,152 of SEQ ID NO: 5 to the sequences of Veni 

et al . 

AFS86217 

ID AFS86217 standard; DNA; 568 BP. 
XX 

AC AFS86217; 
XX 

DT 20-SEP-2007 (first entry) 
XX 

DE Human transcript sequence, SEQ ID 5616. 
XX 

KW DNA detection; RNA detection; exon; ds . 
XX 

OS Homo sapiens . 
XX 

PN WO200268579-A2 . 
XX 

PD 06-SEP-2002. 
XX 

PF 1U-JA1I-2UU2; 2 00 2 /JO-UG 00 02 84 . 
XX 

PR lO-JAN-2001; 2001US-00756696 . 
XX 

PA (PEKE ) PE CORP NY. 

XX 

PI Venter CJ, Adams M, Li PWD, Myers EW; 

XX 

DR WPI; 2002-682812/73. 

XX 

PT New isolated nucleic acid detection reagent for detecting the presence of 

PT specified human exons. 

XX 

PS Claim 4; SEQ ID NO 5616; 40pp; English. 

XX 

CC The present invention relates to a novel isolated nucleic acid detection 

CC reagent for detecting the presence of specified human exons. The exon 

CC sequences cover every identified human transcript and exon comprising 

CC every gene/coding region of the human genome. The present sequence is one 

CC such exon sequence. The nucleic acid detection agent is used for 

CC detecting the presence of at least 100000, at least 2000, at least 50000 

CC or at least 10000 human exons. The sequences that span exon-exon 

CC junctions eliminate false signals caused by genomic contamination. This 
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CC is because a detection element comprising two neighboring exons as one 

CC contiguous sequence will not hybridize to genomic DNA comprising 

CC intervening intronic DNA. These detection elements will only hybridize to 

CC expressed mRNA transcripts in which the exons are connected and the 

CC intronic sequence has been removed, therefore forming one contiguous 

CC stretch of sequence corresponding to the sequence of the detection 

CC element that spans the exon-exon junction. 

XX 

SQ Sequence 568 BP; 149 A; 168 C; 131 G; 120 T; 0 U; 0 Other; 



Pred. No.: 2.06e-59 Length: 

Score: 996.00 Matches: 

Percent Similarity: 100.0% Conservative 

Best Local Similarity: 100.0% Mismatches: 

Query Match: 6.7% Indels: 

DB: 7 Gaps: 



US-10-715-066A-5_COPY_19152_22152 (1-3001) x AFS86217 (1-568) 



2814 ThrGlnHisPheTyrLeuAsnPheThrlleThrAsnLeuProTyrSerGlnAspLysAla 2833 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
2 ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCC 61 



2834 GlnProGlyThrThrAsnTyrGlnArgAsnLysArgAsnlleGluAspAlaLeuAsnGln 2853 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

62 CAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCGCTCAACCAA 121 



2854 LeuPheArgAsnSerSerlleLysSerTyrPheSerAspCysGlnValSerThrPheArg 28 73 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
122 CTCTTCCGAAACAGCAGCATCAAGAGTTATTTTTCTGACTGTCAAGTTTCAACATTCAGG 181 



2874 SerValProAsnArgHisHisThrGlyValAspSerLeuCysAsnPheSerProLeuAla 2893 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
182 TCTGTCCCCAACAGGCACCACACCGGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCT 241 



2894 ArgArgValAspArgValAlalleTyrGluGluPheLeuArgMetThrArgAsnGlyThr 2913 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
242 CGGAGAGTAGACAGAGTTGCCATCTATGAGGAATTTCTGCGGATGACCCGGAATGGTACC 301 



Qy 2914 GlnLeuGlnAsnPheThrLeuAspArgSerSerValLeuValAspGlyTyrSerProAsn 2933 

Db 3 02 CAGCTGCAGAACTTCACCCTGGACAGGAGCAGTGTCCTTGTGGATGGGTATTCTCCCAAC 361 



Qy 2934 ArgAsnGluProLeuThrGlyAsnSerAspLeuProPheTrpAlaVall leLeuI leGly 2953 

Db 3 62 AGAAATGAGCCCTTAACTGGGAATTCTGACCTTCCCTTCTGGGCTGTCATCCTCATCGGC 421 



Qy 2954 LeuAlaGlyLeuLeuGlyLeuIleThrCysLeuIleCysGlyValLeuValThrThrArg 2973 

Db 422 TTGGCAGGACTCCTGGGACTCATCACATGCCTGATCTGCGGTGTCCTGGTGACCACCCGC 481 



Qy 2974 ArgArgLysLysGluGlyGluTyrAsnValGlnGlnGlnCysProGlyTyrTyrGlnSer 2993 

Db 482 CGGCGGAAGAAGGAAGGAGAATACAACGTCCAGCAACAGTGCCCAGGCTACTACCAGTCA 541 



Qy 2994 HisLeuAspLeuGluAspLeuGln 3001 

Db 542 CACCTAGACCTGGAGGATCTGCAA 565 
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Appendix 3 

Alignment of . 



> acids 19,152-21,152 of SEQ ID NO: 5 to the sequences of Algate et 



SQ 



AAH83663; 

25-SEP-2001 (first entry) 

Human ovarian tumour associated polynucleotide sequence SEQ ID NO: 1287. 

Human; ovarian tumour; ovarian cancer; diagnosis; gene therapy; 
immunogenic; vaccine; ss . 

Homo sapiens . 

WO200151513-A2. 

19-JUL-2001. 

16-JAN-2001; 2001WO-US001575 . 
14-JAN-2000; 2000US-0176722P . 
(CORI-) CORIXA CORP. 
Algate PA; 

WPI; 2001-425866/45. 



Novel ovarian ti 
treat and diagnc 



proteins, and nucleic acids encoding them, used 1 
cancers, particularly ovarian cancer. 



Cla: 



ip; English. 



AAH82377 to AAH83878 represent human ovarian 
polynucleotide sequences which encode ovariai 
ovarian tumour protein and polynucleotide sequel 



development of c 
to diagnose the 



.ncer, particularly 
inset and progressic 



:ides 



■associated 
proteins. The 

have cytostatic 



can be used to inhibit the 
cancer. They can also be used 



Sequence 643 BP; 144 A; 151 C; 180 G; 163 T; 0 U; 5 Other; 



Alignment Scores: 
Pred. No. : 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
DB: 



1 .35e-48 

840.50 

96.5% 

96.5% 

5.7% 



Length : 
Matches: 
Conservative : 
Mismatches : 
Indels: 



US-10-715-066A-5_COPY_19152_22152 (1-3001) x AAH83663 (1-643) 

Qy 2747 ThrAsnLeuThrMetAspSerValLeuValThrValLysAlaLeuPheSerSerAsnLeu 2766 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 518 ACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTG 459 
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2767 AspProSerLeuValGluGlnValPheLeuAspLysThrLeuAsnAlaSerPheHisTrp 2786 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 58 GACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGG 39 9 

2 787 LeuGlySerThrTyrGlnLeuValAspIleHisValThrGluMetGluSerSerValTyr 2 8 06 

3 98 CTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTAT 339 



Qy 2807 GlnProThrSerSerSerSerThrGlnHisPheTyrLeuAsnPheThrlleThrAsnLeu 2826 

Db 338 CAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTA 279 



Qy 2827 ProTyrSerGlnAspLysAlaGlnProGlyThrThrAsnTyrGlnArgAsnLysArgAsn 2846 

Db 2 78 CCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAAT 219 



Qy 2847 IleGluAspAla LeuAsnGlnLeuPheArgAsnSerSerlleLys 2861 

Db 218 ATTGAGGATGC-GGTGAGAAGGGGGTGCTCAACCAACTCTTCCGAAACAGCAGCATCAAG 160 



2862 SerTyrPheSerAspCysGlnValSerThrPheArgSerValProAsnArgHisHisThr 2881 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
159 AGTTATTTTTCTGACTGTCAAGTTTCAACATTCAGGTCTGTCCCCAACAGGCACCACACC 100 

2882 GlyValAspSerLeuCysAsnPheSerProLeuAlaArgArgValAspArgValAlalle 2901 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
99 GGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATC 40 

2902 TyrGluGluPheLeuArgMetThrArgAsnGlyThrGln 2914 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
39 TATGAGGAATTTCTGCGGATGACCCGGAATGGTACCCAG 1 



